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ONA-BINDING PYRROLE AND IMIDAZOLE POLYAMIDE DERIVATIVES 
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5 States Government may have certain rights to this 
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25 

FIELD OF THE INVENTION 

This invention relates to the fields of molecular 
biology, biochemistry, and drug design. More 
particularly, the present invention provides synthetic 

30 polyamides containing pyrrole and imidazole amino acids 

which bind specific base pair sequences of double helical 
DNA with affinities and specificities comparable to DNA 
binding proteins such as the transcription factors. A 
series of molecular templates are described which allow 

35 for rational targeting of any predetermined DNA sequence 
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of - therapeutic potential. This non-biological approach to 
DNA recognition provides an underpinning for the design 
of synthetic cell -permeable ligands for the control of 
gene - expr e s s i on . 

5 

BACKGROUND OF THE INVENTION 

In every human cell, genetic information is stored 
on a string- like DNA polymer which is approximately 1 
meter in length and contains 3 x 10 9 units of information 

10 in the form of base pairs, within which is encoded 

approximately 80,000 to 100,000 genes or sets of 
instructions. (Watson, J.D. Gene, 135, 309-315 (1993).) 
The specific interaction of proteins such as 
transcription factors with DNA controls the regulation of 

15 genes and hence cellular processes. (Roeder, R.G. TIBS, 

9, 327-335 (1996).) A wide variety of human conditions 
ranging from cancer to viral infection arise from 
malfunctions in the biochemical machinery that regulates 
gene -express ion. (R. Tjian, Sci . Am., 2, 54-61 (1995).) 

20 Designed small molecules which target specific DNA 
sequences offer a potentially general approach for gene- 
specific regulation. (Gottesfeld, et al . Nature Accepted. 
(1997) . Such molecules could be powerful therapeutics for 
combating life threatening diseases which result from 

25 misregulation in transcription. 

Designed bifunctional small molecules which target 
specific DNA sequences offer a potentially general 
approach for gene-specific, sequence -spec if ic, or 
organism specific modification, detection or capture of 

30 plasmids, genes, cDNA, cosmids, or chromosomes. More 

specifically, a life threatening disease may result from 

9 

a single error within the 3 x 10 units of information 
stored within the double helix. Sequence-specific 
polyamides may discriminate such small errors, hence 
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bifunctional polyamides could have broad diagnostic 
applications which range from determining the molecular 
basis of life threatening diseases to sequence-specific 
visualization of disease genes in living organisms. 

5 The genetic information is in fact, stored on two 

stands of DNA (in antiparallel orientation) in a 
structure termed the double helix. The DNA double helix 
consists of A, T and G,C base pairs held together by 
specific Watson-Crick hydrogen bonds like rungs on a 

10 twisted ladder. (Dickerson, et al . Science, 216, 475 

(1982) . The common B-form of DNA is characterized by a 
wide (12A) and shallow major groove and a deep and narrow 
(4-6 A) minor. Individual sequences may be distinguished 
by the pattern of hydrogen bond donors and acceptors 

15 displayed on the edges of the base pairs. ( Principles of 

Nucleic Acid Structure Sanger, W.; Springer-Verlag, New 
York, 1984.) In the minor groove, the A,T base pair 
presents two symmetrically placed hydrogen bond acceptors 
in the minor groove, the purine N3 and the pyrimidine 02 

20 atoms. The G,C base pair presents these two acceptors, 

but in addition presents a hydrogen bond donor, the 2- 
amino group of guanine (Steitz, T.A. Quart. -Rev. Biophys. 
23, 205) . 

Small molecules isolated from natural sources which 
25 bind DNA are found to be a structurally diverse class, as 

evidenced by consideration of four representative 
molecules, chromomycin, distamycin, actinomycin D, and 
calicheamicin. (Gao, et al . J. Mol . Biol. 223, 259-279. 
(1992); Kamitori, et al . J. Mol. Biol. 225, 445-456 
30 (1992); Paloma, et al . J. Am. Chem. Soc. 116, 3697-3708 

(1994); Coll, et al. Proc. Natl. Acad. Sci. U.S.A. 84, 
8385-8389 (1987)). There is no simple natural recognition 
code for the readout of specific sequences of DNA. 
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The structures of four small molecules isolated from 
natural sources are shown in Figure 1 . Among these DNA- 
binding molecules, distamycin is distinguished by its 
structural simplicity, having no chiral centers and an 
oligopyrrolecarboxamide core structure. (Zimmer, C. 
Prog. Nucleic Acid Res. Mol . Biol. (1975) 15, 285; 
Baguley, B.C. Molecular and Cellular Biochemistry (1982) 
43, 167-181; Zimmer, et al . , Prog. Biophy. Mol. Biol. 47, 
31 (1986)). Structural studies of distamycin-DNA 
complexes reveal modular complexes in which adjacent 
pyrrolecarboxamides makes similar contacts with adjacent 
DNA base pairs. The relative simplicity of distamycin, 
with respect both to its chemical structure and its 
complexes with DNA, guided the initial decision to use 
distamycin as a basis for designed polyamides having 
novel DNA-binding sequence specificity. (Dervan, P.B. 
Science 232, 464-471 (1986).) 

A schematic representation of recognition of A, T 
rich sequences in the minor groove by Distamycin is shown 



Two distinct DNA binding modes exist for Distamycin 
A. In the first binding mode, a single molecule of 
Distamycin binds in the middle of the minor groove of a 5 



20 



below: 




2:1 Distamycin-DNA Complex 
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base pair A,T rich sequence. The amide hydrogens of the 
N-methylpyrrole-carboxamides form bifurcated hydrogen 
bonds with Adenine N3 and thymine 02 atoms on" the floor 
of the minor groove. 10 In the second binding mode, 2 
distamycin ligands form an antiparallel side-by- side 
dimer in the minor groove of a 5 base pair A,T rich site. 
(Pelton, J.G. Sc Wemmer, D.E. (1989) Proc. Natl. Acad. 
Sci. 86, 5723-5727; Pelton, J.G. & Wemmer, D.E. (1990) J. 
Am. Chem. Soc. 112, 1393-1399; Chen, et al . (1994). 
Nature Struct. Biol. 1, 169-175.) In the 2:1 model each 
polyamide subunit forms hydrogen bonds to a unique DNA 
strand in the minor groove. 

Polyamides containing tf-methylpyrrole (Py) and N- 
methyl imidazole (Im) amino acids provide a model for the 
design of artificial molecules for recognition of double 
helical DNA. For side-by-side complexes of Py/Im- 
polyamides in the minor groove of DNA, the DNA binding 
sequence specificity depends on the sequence of side-by- 
side amino acid pairings. (Wade, et al . (1992). J. Am. 
Chem. Soc. 114, 8783-8794; Mrksich, et al . (1992). Proc. 
Natl. Acad. Sci. U.S.A. 89, 7586-7590; Wade, W.S., 
Mrksich, M. & Dervan, P.B. (1993); Biochemistry 32, 
11385-11389 (1993)). A pairing of Im opposite Py targets 
a G*C base pair while a pairing of Py opposite Im targets 
a OG base pair. A Py/Py combination is degenerate 
targeting both A*T and T«A base pairs. Specificity for 
G,C base pairs results from the formation of a putative 
hydrogen bond between the imidazole N3 and the exocyclic 
amine group of guanine. Validity of the pairing rules is 
supported by a variety of footprinting and NMR structure 
studies. (Mrksich, et al., J. Am. Chem. Soc, 115, 2572 
(1993); Geierstanger, et al . Science, 266, 646 (1994); 
Mrksich et al . , J. Am. Chem. Soc, 117, 3325 (1995).) 
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A schematic representation of the polyamide pairing 
rules is shown below: 



10 



15 



20 




Py / Im targets OG 
Py / Py targets A*T and T»A 
Im / Py targets G<? 

y y 

In parallel with the elucidation of the scope and 
limitations of the pairing rules, efforts have been made 
to increase the DNA-binding affinity and specificity of 
pyrrole -imidazole polyamides by covalently linking 
polyamide subunits. (Mrksich, M. & Dervan, p.B. (1993). 
J. Am. Chem. Soc. 115, 9892-9899; Dwyer, et al . (1993). 
J. Am. Chem. Soc. 115, 9900-9906; Mrksich, M. & Dervan, 
P.B. (1994). J. Am. Chem. Soc. 116, 3663-3664; Chen, Y.H. 
and Lown, J.W. (1994) J. Am. Chem. Soc. 116, 6995-7005. 
Chen, Y.H. and Lown, J.W. Heterocycles 41, 1691-1707 
(1995). Geierstanger, et al . , Nature Structural Biology, 
3, 321 (1996). Chen, et al . J. Biomol. Struct. Dyn. 14, 
341-355 (1996); Cho, et al . Proc. Natl. Acad. Sci. USA, 
92, 10389 (1995)). A simple hairpin polyamide motif with 
y-aminobutyric acid (y) serving as a turn-specific 
internal -guide -residue provides a synthetically 
accessible method of linking polyamide subunits within 
the 2:1 motif. The head- to- tail linked polyamide ImPyPy-y- 
PyPyPy-dimethylaminopropylamide (Dp) was shown to 
specifically bind the designated target site 5'-TGTTA-3' 
with an equilibrium association constant of Ka = 8 x lo 7 
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M* 1 , an increase of 300-fold relative to the unlinked 
three-ring polyamide pair ImPyPy and PyPyPy. (Mrksich, et 
al. J. Am. Chem, Soc. 116, 7983-7988). The hairpin 
polyamide model is supported by f ootprinting, affinity 
5 cleaving and NMR structure studies. (Church, et al . 

Biochemistry 1990, 29, 6827; He, et al . J. Am. Chem. Soc. 
1993, 115, 7061; de Clairac, et al . J. Am. Chem. Soc. 
submitted. ) 

A schematic representation of recognition of a 5'- 
10 TGTTA-3 ' sequence by unlinked subunits (left) and y- 

aminobutyric acid linked subunits (right) is shown below: 




ImPyPy-Dp / PyPyPy • TGTTA ImPyPy-y-PyPyPy-Dp • TGTTA 

K a ^2xl0 5 M 1 K a = 8xl0 7 M- 1 



Closing the ends of the hairpin to form a cyclic 
polyamide increases the overall energetics for DNA- 

15 binding presumably by restricting conformational space 

for the molecule. (Lown, J.W. and Krowicki, K. J. Org. 
Chem. 1985, 50, 3774.) A cyclic polyamide cyclo- (ImPyPy-y- 
PyPyPy-y-) was shown to specifically bind the designated 
target site 5' -TGTTA-3' with an equilibrium association 

20 constant of = 2 . 9 x 10 9 NT 1 , an increase of 40- fold 

relative to the corresponding hairpin polyamide of 
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sequence composition ImPyPy-y-PyPyPy . The sequence- 
specificity versus single base pair mismatch sites drops 
from 30- fold for the hairpin polyamide to 2 -fold for the 
cyclic polyamide. 

A schematic representation of a cyclic polyamide 
recognizing the minor groove is shown below: 




specificity: 30-fold sperifitity: 2-fold 



Despite the design breakthrough in molecular 
recognition of DNA, the binding affinities of linked and 
unlinked polyamide dimers of the prior art are modest 
when compared to those found with natural DNA binding 
proteins. (Clemens, et al . J". Mol . Biol. 244, 23-35 
(1994)). For example DNA-binding transcription factors 
recognize their cognate sites at subnanomolar 
concentrations. (Jamieson, et al . Biochemistry 33, 5689- 
5695 (1994); Choo, Y. and Klug, A. Proc.Natl. Acad. Sci. 
U.S.A. 91, 11168-11172 (1994); Greisman, H.A. and Pabo, 
CO. Science 275, 657-661 (1997)). Six-ring hairpin 
polyamides require concentrations greater than 10 nM to 
occupy their target sites. The only class of polyamides 
described in the prior art with affinities similar to 
DNA-binding proteins are the 6 -ring cyclic polyamides; 
however, this class of molecules lacks the sequence- 
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specificity of proteins (i.e. an energetic penalty for 
binding a single base pair mismatch site) and therefore 
currently has no potential for therapeutic applications. 

Two prior approaches for the development of 
5 synthetic transcriptional antagonists have been reported. 

Oligodeoxynucleotides which recognize the major groove of 
double helical DNA via triple helix formation bind a 
broad sequence repertoire with high affinity and 
specificity (Moser, H. E. & Dervan, P. B. Science 238, 

10 645-650 (1987); Thuong, et al . Angew. Chem. Int. Ed. 

Engl. 32, 666-690 (1993)). Although oligonucleotides and 
their analogs have been shown to interfere with gene 
expression (Maher, et al . Biochemistry 31, 70-81 (1992); 
Duvalvalentin, et al . Proc . Natl. Acad. Sci . U.S.A. 89, 

15 504-508 (1992)). The triple helix approach is limited to 

purine tracks and suffers from poor cellular uptake. 
There are a few examples of cell -permeable carbohydrate 
based ligands that interfere with transcription factor 
function. (Ho, et al . Proc. Natl. Acad. Sci. USA 91, 

20 9 2 0 3 - 92 0 7 (19 94); Liu, C. et al . Proc. Natl. Acad. Sci. 

USA 93, 940-944 (1996)). However oligosaccharides are not 
yet amenable to recognition of a broad range of 
predetermined DNA sequences . 

Because of the small size and hydrophobic nature of 

25 polyamides (MW * 1200) and because the parent ligand 

Distamycin is itself cell -permeable these ligands have 
the potential to underpin a new field of small molecule 
regulation of gene expression. It remained to be 
determined if low molecular weight (MW « 1200) pyrrole - 

30 imidazole polyamides could be constructed which recognize 

predetermined DNA sites at subnanomolar concentrations 
without compromising sequence-selectivity. 
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SUMMARY OF THE INVENTION 

This invention provides improved polyamides for 
selectively binding a DNA molecule. Compounds of the 
present invention comprise a polyamide of the formula: 




where R 1 , R a , R b , R e , R f , R 1 , R\ R n , and R° are chosen 
independently from H, Cl, NO, N-acetyl, benzyl, d- 6 
alkyl, Ci- 6 alkylamine, Ci-<s alkyldiamine, Ci-6 
alkylcarboxylate, d- 6 alkenyl, and d. 6 alkynyl; 

R 2 is selected from the group consisting of H, NH 2 , 
SH, Cl, Br, F, N-acetyl, and N-formyl; 

R 3 , R d , R 1 and R q are selected independently from the 
group consisting of H, NH 2 , OH, SH, Br, Cl , F, OMe, CH 2 OH, 
CH 2 SH, CH 2 NH 2 ; 

R 4 is -NH(CH 2 ) 0 - 6 NR 5 R € or NH(CH 2 ) r CO NH (CH 2 ) 0 . 6 NR 5 R 6 or 
NHR 5 or NH ( CH 2 ) r CONHR 5 , where R 5 and R 6 are independently 
chosen from H, Cl, NO, N-acetyl, benzyl, C a - fi alkyl, d- 6 
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alkylamine, Ci- 6 alkyldiamine , Ci_ 6 alkylcarboxylate, d- 6 
alkenyl, Ci_ 6 L, where L groups are independently chosen 
from biotin, oligodeoxynucleotide, N-ethylnitrosourea, 
fluorescein , bromoacetamide , iodoace tamide , DL-a- 1 ipoic 
acid, acridine, ethyl red, 4- (psoralen- 8 -yloxy) -butyrate, 
tartaric acid, ( + ) -a-tocopheral , and Ci_ 6 alkynyl, where r 
is an integer having a value ranging from 0 to 6; 

X, X a , X b , X e , X f , X 1 , X j , X n , X° are chosen 
independently from the group consisting of N, CH, COH, 
CCH 3 , CNH 2 , CC1, CF; and 

a, b, c, d, e, f, i, j, k, and m are integers chosen 
independently, having values ranging from 0 to 5; 
or a pharmaceutically acceptable salt thereof. 

The invention further comprises a polyamide having 
the formula : 




where R 1 , R ati - m) and R b ^- m) are chosen independently from H, 
CI, NO, N-acetyl, benzyl, Ci- 6 alkyl, Ci- 6 alkylamine, Ci- 6 
alkyldiamine, Ci- 6 alkylcarboxylate, Ci- 6 alkenyl, and Ci- 6 
alkynyl; 
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R 2 is selected from the group consisting of H, NH 2/ 
SH, CI, Br, F, N-acetyl, and N-formyl; 

R f(m) and R c<k ' m) are selected independently from the 
group consisting of H, NH 2 , OH, SH, Br, CI, F, OMe, CH 2 OH, 
CH 2 SH, CH 2 NH 2 ; 

R 4 is -NH(CH 2 ) 0 , 6 NR 5 R 6 or NH(CH 2 ) r CO NH (CH 2 ) 0 - 6 NR 5 R 6 or 
NHR 5 or NH(CH 2 ) r CONHR 5 , where R 5 and R 6 are independently 
chosen from H, CI, NO, N-acetyl, benzyl, d_ 6 alkyl, Cx- 6 
alkylamine, C L . 6 alkyldiamine, Ci- 6 alkylcarboxylate, Ci. 6 
alkenyl, Ci- 6 L, where L groups are independently chosen 
from biotin, oligodeoxynucleotide, N-ethylnitrosourea, 
fluorescein, bromoacetamide, iodoacetamide , DL-a-lipoic 
acid, acridine, ethyl red, 4- (psoralen- 8 -yloxy) -butyrate, 
tartaric acid, ( + ) -a-tocopheral , and C x . $ alkynyl, where 
r is an integer having a value ranging from 0 to 6; 

X, x a(i ' m) and X b(j ' m) are chosen independently from the 
group consisting of N, CH, COH, CCH 3/ CNH 2/ CC1, CF; and 

a, b, c, d, e, f, g, h, i, j, k, 1, m, n, o and p 
are integers chosen independently, having values ranging 
from 0 to 5; 

or a pharmaceutically acceptable salt thereof. 

By tt alkyl" or "lower alkyl" in the present invention 

is meant Ci-C 6 alkyl, i.e., straight or branched chain 
alkyl groups having 1-6 carbon atoms, such as, for 
example, methyl, ethyl, propyl, isopropyl, n-butyl, sec- 
butyl, tert-butyl, pentyl, 2-pentyl, isopentyl, 
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neopentyl, hexyl, 2 -hexyl, 3 -hexyl, and 3 -methylpentyl . 
Preferred C1-C6 alkyl groups are methyl, ethyl, propyl, 
butyl, cyclopropyl or cyclopropylmethyl . Particularly 
preferred are Ci-C alkyl groups such as methyl, ethyl, 
and propyl . 

BRIEF DESCRIPTION OF THE FIGURES 

FIGURE 1. Small molecules isolated from natural sources. 

FIGURE 2. Hairpin polyamides. 

FIGURE 3. Chemical structures of polyamides. 

FIGURE 4. Solid phase synthesis of polyamides. 

FIGURE 5. Extended hairpin polyamides. 

FIGURE 6. Association profile of extended hairpin 

polyamides . 
FIGURE 7. Binding models for polyamides. 

FIGURE 8. Schematic binding models for eight ring hairpin 
poly amide . 

FIGURE 9. Eight-residue hairpin polyamides. 
FIGURE 10. Structure of 4-p-4 polyamides. 
FIGURE 11 •Recognition of DNA by 4~(5-4 polyamides 
FIGURE 12. Placement of p/p pairs. 

FIGURE 13. p-linked fully overlapped polyamide complexes. 
FIGURE 14. 10 -ring hairpin polyamides. 

FIGURE 15 .Discrimination of seven base pair sequence by 
polyamides . 

FIGURE 16. Hairpin polyamides that recognize seven base 

pair sequence. 
FIGURE 17.Dnase I footprint titration. 
FIGURE 18. Ni (II) -Gly-Gly-His modified polyamide. 
FIGURE 19 .Bromoacetylated hairpin polyamide. 
FIGURE 20. Structure of ( + ) CO1065 and duocarmycins . 
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FIGURE 21.Alkylation mechanism of CC-1065. 
FIGURE 22. Structure of Bizlesin and CBI . 
FIGURE 2 3. Synthesis of CBI-polyamide conjugate. 
FIGURE 24. Synthesis of bifunctional methidium-polyamide 
conjugates. 

FIGURE 25. Synthesis of polyamide-rhodamine conjugate. 

FIGURE 26. Structure of polyamide-DYE conjugates. 

FIGURE 27. Synthesis of biotin-polyamide conjugates. 

FIGURE 28. Bifunctional biotin-polyamide conjugates. 

FIGURE 29. Affinity capture using bifunctional biotin- 
polyamide conjugates. 

FIGURE 30.Psoralen-polyamide conjugate. 

FIGURE 31. Cooperative dimerization of polyamides. 

FIGURE 32. Binding of polyamides to mismatched sites. 

FIGURE 33. Footprint titration of polyamides. 

FIGURE 34.Generalizable polyamide motifs. 

FIGURE 35. Examples of polyamides. 

FIGURE 36 .Determination of polyamide affinity. 

FIGURE 37. N- terminally extended polyamides. 

FIGURE 38. Polyamides binding 16 base pair sequence. 

FIGURE 39. Determination of 16 base pair sequence. 

FIGURE 40 .Binding of polyamides to mismatched sites. 

FIGURE 41. P-substitution in polyamides. 

FIGURE 42. Affinity determinations for p-substituted 
polyamides . 

FIGURE 43. Binding of polyamides to TATA box. 

DETAILE D DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Within this application, unless otherwise stated, 
definitions of the terms and illustration of the 
techniques of this application may be found in any of 
several well-known references such as: Sambrook, J., et 
al., Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory Press (1989); Goeddel, D., ed. , Gene 
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Expression Technology, Methods in Enzymology, 185, 
Academic Press, San Diego, CA (1991) ; "Guide to Protein 
Purification'' in Deutshcer, M.P., ed., Methods in 
Enzymology, Academic Press, San Diego, CA (1989) ; Innis, 

5 et al . , PCR Protocols: A Guide to Methods and 

Applications, Academic Press, San Diego, CA (1990) ; 
Freshney, R.I., Culture of Animal Cells: A Manual of 
Basic Technique, 2 nd Ed., Alan Liss, Inc. New York, NY 
(1987); Murray, E.J'. , ed., Gene Transfer and Expression 

10 Protocols, pp. 109-128, The Humana Press Inc., Clifton, 

NJ and Lewin, B., Genes VI, Oxford University Press, New 
York (1997) . 

For the purposes of this application, a promoter is 
a regulatory sequence of DNA that is involved in the 

15 binding of RNA polymerase to initiate transcription of a 

gene. A gene is a segment of DNA involved in producing a 
peptide, polypeptide or protein, including the coding 
region, non-coding regions preceding ("leader") and 
following ("trailer") the coding region, as well as 

20 intervening non-coding sequences ("introns") between 

individual coding segments ("exons"). Coding refers to 
the representation of amino acids, start and stop signals 
in a three base "triplet" code. Promoters are often 
upstream ( n '5 to") the transcription initiation site of 

25 the corresponding gene. Other regulatory sequences of 

DNA in addition to promoters are known, including 
sequences involved with the binding of transcription 
factors, including response elements that are the DNA 
sequences bound by inducible factors. Enhancers comprise 

30 yet another group of regulatory sequences of DNA that can 

increase the utilization of promoters, and can function 
in either orientation (5' -3' or 3' -5') and in any 
location (upstream or downstream) relative to the 
promoter. Preferably, the regulatory sequence has a 
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positive activity, i.e., binding of an endogeneous ligand 
(e.g. a transcription factor) to the regulatory sequence 
increases transcription, thereby resulting in* increased 
expression of the corresponding target gene. In such a 
case, interference with transcription by binding a 
polyamide to a regulatory sequence would reduce or 
abolish expression of a gene. 

The promoter may also include or be adjacent to a 
regulatory sequence known in the art as a silencer, A 
silencer sequence generally has a negative regulatory 
effect on expression of the gene. In such a case, 
expression of a gene may be increased directly by using a 
polyamide to prevent binding of a factor to a silencer 
regulatory sequence or indirectly, by using a polyamide 
to block transcription of a factor to a silencer 
regulatory sequence. 

It is to be understood that the polyamides of this 
invention bind to double stranded DNA in a sequence 
specific manner. The function of a segment of DNA of a 
given sequence, such as 5 ' -TATAAA-3 ' , depends on its 
position relative to other functional regions in the DNA 
sequence. In this case, if the sequence 5' -TATAAA-3' on 
the coding strand of DNA is positioned about 30 base 
pairs upstream of the transcription start site, the 
sequence forms part of the promoter region (Lewin, Genes 
VI, pp. 831-835). On the other hand, if the sequence 5'- 
TATAAA-3 ' is downstream of the transcription start site 
in a coding region and in proper register with the 
reading frame, the sequence encodes the tyrosyl and lysyl 
amino acid residues (Lewin, Genes VI, pp. 213-215) . 

While not being held to one hypothesis, it is 
believed that the binding of the polyamides of this 
invention modulate gene expression by altering the 
binding of DNA binding proteins, such as RNA polymerase, 
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transcription factors, TBF, TFIIIB and other proteins. 
The effect on gene expression of polyamide binding to a 
segment of double stranded DNA is believed to be related 
to the function, e.g., promoter, of that segment of DNA. 

It is to be understood by one skilled in the art 
that the improved polyamides of the present invention may 
bind to any of the above -described DNA sequences or any 
other sequence having a desired effect upon expression of 
a gene. In addition, U.S. Patent No. 5,578,444 describes 
numerous promoter targeting sequences from which base 
pair sequences for targeting an improved polyamide of the 
present invention may be identified. 

It is generally understood by those skilled in the 
art that the basic structure of DNA in a living cell 
includes both major and a minor groove. For the purposes 
of describing the present invention, the minor groove is 
the narrow groove of DNA as illustrated in common 
molecular biology references such as Lewin, B. , Genes VI, 
Oxford University Press, New York (1997) . 

To affect gene expression in a cell, which may 
include causing an increase or a decrease in gene 
expression, a effective quantity of one or more polyamide 
is contacted with the cell and internalized by the cell. 
The cell may be contacted in vivo or in vitro. Effective 
extracellular concentrations of polyamides that can 
modulate gene expression range from about 10 nanomolar to 
about 1 micromolar. Gottesfeld, J.M. , et al . , Mature 387 
202-205 (1997) . To determine effective amounts and 
concentrations of polyamides in vitro, a suitable number 
of cells is plated on tissue culture plates and various 
quantities of one or more polyamide are added to separate 
wells. Gene expression following exposure to a polyamide 
can be monitored in the cells or medium by detecting the 
amount of the protein gene product present as determined 
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by various techniques utilizing specific antibodies, 
including ELISA and western blot. Alternatively, gene 
expression following exposure to a polyamide can be 
monitored by detecting the amount of messenger RNA 
present as determined by various techniques, including 
northern blot and RT-PCR. 

Similarly, to determine effective amounts and 
concentrations of polyamides for in vivo administration, 
a sample of body tissue or fluid, such as plasma, blood, 
urine, cerebrospinal fluid, saliva, or biopsy of skin, 
muscle, liver, brain or other appropriate tissue source 
is analyzed. Gene expression following exposure to a 
polyamide can be monitored by detecting the amount of the 
protein gene product present as determined by various 
techniques utilizing specific antibodies, including ELISA 
and western blot. Alternatively, gene expression 

following exposure to a polyamide can be monitored by the 
detecting the amount of messenger RNA present as 
determined by various techniques, including northern blot 
and RT-PCR. 

The polyamides of this invention may be formulated 
into diagnostic and therapeutic compositions for in vivo 
or in vitro use. Representative methods of formulation 
may be found in Remington: The Science and Practice of 
Pharmacy, 19th ed., Mack Publishing Co., Easton, PA 
(1995) . 

For in vivo use, the polyamides may be incorporated 
into a physiologically acceptable pharmaceutical 
composition that is administered to a patient in need of 
treatment or an animal for medical or research purposes. 
The polyamide composition comprises pharmaceutical ly 
acceptable carriers, excipients, adjuvants, stabilizers, 
and vehicles. The composition may be in solid, liquid, 
gel, or aerosol form. The polyamide composition of the 
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present invention may be administered in various dosage 
forms orally, parentally, by inhalation spray, rectally, 
or topically. The term parenteral as used herein 
includes , subcutaneous , intravenous , intramuscular, 
5 intrasternal , infusion techniques or intraperitoneally . 

The selection of the precise concentration, 
composition, and delivery regimen is influenced by, inter 
alia, the specific pharmacological properties of the 
particular selected compound, the intended use, the 

10 nature and severity of the condition being treated or 

diagnosed, the age, weight, gender, physical condition 
and mental acuity of the intended recipient as well as 
the route of administration. Such considerations are 
within the purview of the skilled artisan. Thus, the 

15 dosage regimen may vary widely, but can be determined 

routinely using standard methods. 

Polyamides of the present invention are also useful 
for detecting the presence of double stranded DNA of a 
specific sequence for diagnostic or preparative purposes. 

20 The sample containing the double stranded DNA can be 

contacted by polyamide linked to a solid substrate, 
thereby isolating DNA comprising a desired sequence. 
Alternatively, polyamides linked to a suitable detectable 
marker, such as biotin, a hapten, a radioisotope or a dye 

25 molecule, can be contacted by a sample containing double 

stranded DNA. 

The design of bifunctional sequence specific DNA 
binding molecules requires the integration of two 
separate entities: recognition and functional activity. 
30 Polyamides that specifically bind with subnanomolar 

affinity to the minor groove of a predetermined sequence 
of double stranded DNA are linked to a functional 
molecule, providing the corresponding bifunctional 
conjugates useful in molecular biology, genomic 



19 



WO 98/49142 



PCT/US98/06997 



sequencing, and human medicine. Polyamides of this 

invention can be conjugated to a variety of functional 
molecules, which can be independently chosen from but is 
not limited to arylboronic acids, biotins, polyhistidines 
5 comprised from about 2 to 8 amino acids, haptens to which 

an antibody binds, solid phase supports, 

oligodeoxynucleotides, N-ethylnitrosourea, fluorescein, 
bromoacet amide, iodoacetamide, DL-a-lipoic acid, 
acridine, captothesin, pyrene, mitomycin, texas red, 

10 anthracene, anthrinilic acid, avidin, DAPI, isosulfan 

blue, malachite green, psoralen, ethyl red, 4- (psoraen-8- 
yloxy) -butyrate, tartaric acid, (+) -a-tocopheral , 
psoralen, EDTA, methidium, acridine, Ni (II) •Gly-Gly-His, 
TO, Dansyl, pyrene, N-bromoacetamide , and gold particles. 

15 Such bifunctional polyamides are useful for DNA affinity 

capture, covalent DNA modification, oxidative DNA 
cleavage, DNA photocleavage . Such bifunctional polyamides 
are useful for DNA detection by providing a polyamide 
linked to a detectable label. DNA complexed to a labeled 

20 polyamide can then be determined using the appropriate 

detection system as is well known to one skilled in the 
art. For example, DNA associated with a polyamide linked 
to biotin can be detected by a streptavidin / alkaline 
phosphatase system. 

25 The present invention also describes a diagnostic 

system, preferably in kit form, for assaying for the 
presence of the double stranded DNA sequence bound by the 
polyamide of this invention in a body sample, such brain 
tissue, cell suspensions or tissue sections, or body 

30 fluid samples such as CSF, blood, plasma or serum, where 

it is desirable to detect the presence, and preferably 
the amount, of the double stranded DNA sequence bound by 
the polyamide in the sample according to the diagnostic 
methods described herein. 
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The diagnostic system includes, in an amount 
sufficient to perform at least one assay, a specific 
polyamide as a separately packaged reagent. Instructions 
for use of the packaged reagent (s) are also typically 
included. As used herein, the term "package" refers to a 
solid matrix or material such as glass, plastic (e.g., 
polyethylene, polypropylene or polycarbonate) , paper, 
foil and the like capable of holding within fixed limits 
a polyamide of the present invention. Thus, for example, 
a package can be a glass vial used to contain milligram 
quantities of a contemplated polyamide or it can be a 
microliter plate well to which microgram quantities of a 
contemplated polypamide have been operatively affixed, 
i.e., linked so as to be capable of being bound by the 
target DNA sequence. "Instructions for use" typically 
include a tangible expression describing the reagent 
concentration or at least one assay method parameter such 
as the relative amounts of reagent and sample to be 
admixed, maintenance time periods for reagent or sample 
admixtures, temperature, buffer conditions and the like. 
A diagnostic system of the present invention preferably 
also includes a detectable label and a detecting or 
indicating means capable of signaling the binding of the 
contemplated polyamide of the present invention to the 
target DNA sequence. As noted above, numerous detectable 
labels, such as biotin, and detecting or indicating 
means, such as enzyme-linked (direct or indirect) 
streptavidin, are well known in the art. 

Trauger,et al . (Mature, 382: 559-561) and Swalley, 
et al. (J. Am. Chem. Soc. 119: 6953-6961) have described 
recognition of DNA by certain polyamides at subnanomolar 
concentrations. Pairing specific carboxyamide groups 
allows for recognition of specific DNA sequences 
(Swalley, et al . supra). Polyamides comprising Hp, Im, 
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and Py provide for coded targeting of pre -determined DNA 
sequences with high affinity and specificity. Im and Py 
polyamides may be combined to form Im/Py, Py/Im, Py/Py 
binding pairs which complement the four Watson-Crick base 
pairs A, C, G, and T. Table 1 illustrates such pairings. 
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TABLE 1 

Pairing Codes for Base Pair Recognition* 

Pair G»C OG T*A A«T 

Im/Py + - - 

Py/Im + 

Im/P + 

P/Im + 

Py/Py + + 



* favored (+), disfavored (-) 
5 The basic polyamide pairing rules of the prior art 

are insufficient for design of ligands recognizing target 
sites having subnanomolar binding affinities. Additional 
second generation rules for polyamide design are provided 
herein . Each additional rule alone may not be sufficient 

10 for design of polyamides with subnanomolar affinity. 

However, simultaneous application of the second 
generation design rules provided herein allows for the 
construction of a number of versatile molecular templates 
for polyamide design. 

15 It has been found that a hairpin polyamide 

synthesized from Boc-p- alanine -Pam- Resin, ImPyPy-y-PyPyPy- 
P-Dp binds with both enhanced affinity and specificity 
relative to the parent compound, ImPyPy-y-PyPyPy-Dp, which 
lacks the C- terminal p-alanine residue. (Optimization of 

20 the Hairpin Polyamide Design for Recognition of the Minor 

Groove of DNA, M. E. Parks, E. E. Baird and P. B. 
Dervan, J. Am. Chem. Soc, 118, 6147 (1996).) More 
specifically ImPyPy-y-PyPyPy-p-Dp binds with an apparent 
first order association constant, = 3 x 10 8 M" 1 , a 

25 factor of four greater than the parent polyamide, ImPyPy- 

y-PyPyPy-Dp, Ka = 8 x 10 7 M" X . Furthermore, ImPyPy-y-PyPyPy- 
p-Dp binds the target 5'-TGTTA-3' match site with 60-fold 
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specificity relative to a single base pair 5'-TGACA-3' 
mismatch site. This can be compared with the parent 
polyamide ImPyPy-y-PyPyPy-Dp which has a 24 -fold specific 
binding relative to the same two DNA sites. The modest 
increased binding affinity of the C-terminal p-alanine 
polyamide, may result from an additional hydrogen bond 
between the P-alanine carboxamide and a 1 sixth' base pair 
of the binding site. 

Three or four-ring improved polyamides of the 
present invention are covalently coupled to form six or 
eight -ring structures, respectively, that bind 
specifically to four or six base pair targets, 
respectively, at subnanomolar concentrations. As such, 
the improved polyamides of the present invention may be 
directed to any DNA sequence comprised of A, C, G, or T. 

In one embodiment, the present invention comprises 
improved polyamides having three or four- ring polyamide 
structures covalently coupled to form six or eight -ring 
hairpin structures, respectively, of the general 
structures I - XXVIII: 



Xj X2X3X4YX5X6X7X8 
I 

X 1 X2X3 PX4X5X6 

III 

XIX2X3X4X5PX6X7X8 
V 

X i X2X3 PX4X5X6X7X8 
VII 



X 1 X2X3X4X5YX6X7X8X9X 10 
II 

Xi X2X3X4PX5X6X7X8 
IV 

X 1 X2X3X4PX5X6X7 
VI 

X1X2X3X4PX5X6X7X8 
VIII 



X 1 X2X3X4X5 PX6X7X8X9X ] 0 
IX 

XiX2X3X4X5PyX6X7XgX9Xio 



X| X2X3X4X5X6yX7X8X9Xi(>Xn X 12 

X 

X 1 pX 2 X3YX 4 pX 5 X 6 
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XI 

X 1 X 2 px 3 X4yX5X 6 pX 7 X 8 
XII 

X,X2X3pX 4 X5YX 6 X7X 8 pX 9 X l o 
XV 

X, pX 2 X3X4X5yX6pX 7 X 8 X9X,o 



XII 

X,X2PX3X4X 5 yX 6 X7X8PX 9 Xio 
XIV 

X,X 2 X3X4pX5yX 6 pX 7 X8X9X 1 o 

XVI 

X I X2X3X4pX 5 yX 6 X7X 8 X 9 pX 1 o 



10 



15 



XVII 

X|X 2 X 3 pX4XjyX 6 X7pX 8 X9X,o 
XIX 

X, pX 2 X3X 4 X 5 yX <i X7X 8 X 9 pX 1 o 
XXI 

X|X 2 X3pX4X5XePX7X 8 X9 
XXIII 

XiX 2 X3yX4X$X6PX7X 8 X 9 
XXV 

XiX 2 X3yX4X5XsGX7X 8 X 9 
XXVII 



XVIII 

X, X 2 pX3X 4 X5YX 6 X 7 pX 8 X 9 X,o 
XX 

X.X2PX3X4PX5X6PX7X8 

XXII 

X)X 2 X3X4PXsPX6X7X8X 9 

XXIV 

X,X 2 X3yX4X5X 6 pX 7 X 8 X 9 px,oX„X, 2 
XXVI 

XiX 2 X3X4yXsX$X7X 8 pX 9 XioX| iXi 2 

xxvni 
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where Xi-i 2 is a substituted imidazole such as N- 
methylimidazolecarboxamide (Im) , or a substituted pyrrole 
such as N-methylpyrrolecarboxamide (Py) . An improved 
polyamide of the present invention may also include a C- 
terminal aliphatic amino acid such as a P-alanine residue 
(P) joined to an amide group such as 
dimethylaminopropylamide (Dp) . In addition, an improved 
polyamide of the present invention may further include a 
aliphatic amino acid such as p-alanine residue (P) or 
glycine (G) , an amide group such as 

dimethylaminopropylamide (Dp) , an alcohol such as EtOH, 
an acid such as ethylenediaminetetraacetic acid (EDTA) , 
or any derivative thereof joined to the y-aminobutyric 
acid (y) residue. 
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The use of p-alanine in the synthetic methods 
provides aromatic/aliphatic pairing (Im/P, P/lm, Py/p, 
and P/Py) and aliphatic/aliphatic pairing (P/P) 
substitution. The use of y-aminobutyric acid, or a 
substituted y-aminobutyric acid such as (R)-2,4 
diaminobutyric acid, provides for preferred hairpin 
turns. Many other groups suitable for the purposes of 
practicing this invention are well known and widely 
available to one skilled in the art. 

The polyamide subunit structures I -XXVI I I above, and 
XXIX below may be covalently coupled through the y residue 
which represents a -NH-CH 2 -CH 2 -CH 2 -CONH- hairpin linkage 
derived from y-aminobutyric acid or a chiral hairpin 
linkage derived from R-2 , 4 -diaminobutyric acid. The 
present invention provides the reagents and methodologies 
for substituting the y-residue of certain polyamides with 

a moiety such as (R) -2, 4 , -diaminobutyric acid ((i?) H2N y). 
The NMR structure of a hairpin polyamide of sequence 
composition ImPyPy-y-PyPyPy complexed with a 5'-TGTTA-3' 
target site indicated that it was possible to substitute 
the a-position of the y-aminobutyric acid residue within 
the hairpin-DNA complex (de Claire, et al . J. Am. Chem. 
Soc. 1997, 119, 7909) . Modeling indicated that replacing 
the a-H of y with an amino group that may confer an R- 
conf iguration at the a-carbon and could be accommodated 
within the floor and walls of the minor groove. 

A polyamide of Formulas I-XXIX may also be 
conjugated to a bi functional group including but not 
limited to arylboronic acid, biotins, polyhistidine of 2 
to 8 amino acids, hapten to which an antibody binds, 
solid phase support, oligodeoxynucleotide, N- 
ethylnitrosourea, fluorescein, bromoacetamide, 
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iodoacetamide, DL-a-lipoic acid, acridine, captothesin, 
pyrene, mitomycin, texas red, anthracene, anthrinilic 
acid, avidin, DAPI, isosulfan blue, malachite green, 
psoralen, ethyl red, 4- (psoraen-8-yloxy) -butyrate, 
tartaric acid, or (+) -a- tocopheral . Many other groups 
suitable for the purposes of practicing this invention 
are well known and widely available to one skilled in the 
art . 

As used herein "polyamide" refers to a polymer 
comprising the subunits listed below: 




Boc-Py-COOH Boc-im-COOH Im-COOH 



O R 3 0 R 3 O 

h * n y^oh h.n-Y^" ^rV^ 

R 3 R 3 R 3 R 3 

a-amino acid p-amino acid 7-amino acid 



where R 1 is C1-100 alkyl (preferably Ci_ 10 alkyl such as 
methyl, ethyl, isopropyl) , Cx-100 alkylamine 
(preferably C1-10 alkylamine such as ethylamine) , C1-100 
alkyldiamine (preferably C1-10 alkyldiamine such as 
N,N-dimethylpropylamine) , C1-100 alkylcarboxylate 
(preferably a Ci_ 10 alkylcarboxylate such as-CH 2 COOH) , 
Ci-xoo alkenyl (preferably Cx- 10 alkenyl such as 
CH 2 CH=CH 2 ) , Ci-100 alkynyl (preferably d- 10 alkynyl 
such as CH 2 C=CH 3 ) , or C1-100L; 

L includes but is not limited to an arylboronic acid, 
biotin, polyhistidine comprising from 2 to 8 amino 
acids, hapten to which an antibody binds, solid 
phase support, oligodeoxynucleotide, N- 

ethylnitrosourea, fluorescein, bromoacet amide, 
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iodoacetamide, DL-a-lipoic acid, acridine, 
captothesin, pyrene, mitomycin, texas red, 
anthracene, anthrinilic acid, avidin, DAPI , 
isosulfan blue, malachite green, psoralen, ethyl 
red, 4- (psoralen-8-yloxy) -butyrate, tartaric acid, 
and (+) -a-tocopheral ; 

m is an integer value ranging from 0 to 12; 

R 2 is H, NH 2 , SH, CI, Br, F, N-acetyl, or N-formyl; 

R3 is H, NH 2 , OH, SH, Br, Cl , F, OMe, CH 2 OH, CH 2 SH, or 

CH 2 NH 2 ; and, 

X is N, CH, COH, CCH 3 , CNH 2 , CC1 , or CF. 

In a preferred embodiment, R 5 and R 6 are H. 

The compounds of the present invention may comprise 
a compound of Formula XXIX or XXX: 




XXIX 

where R 1 , R a , R b , R e , R f , R 1 , R j , R n , and R° are chosen 
independently from H, Cl, NO, N-acetyl, benzyl, Ci- 6 



28 




WO 98/49142 PCT/US98/06997 

alkyl, Ci- 6 alkylamine, d-6 alkyldiamine, d-6 
alkylcarboxylate, Ci-e alkenyl, and Ci_ 6 alkynyl; 

R is selected from the group consisting of H, NH 2/ 
SH, CI, Br, F, N- acetyl, and N-formyl; 
5 R 3 , R d , R 1 and R q are selected independently from the 

group consisting of H, NH 2 , OH, SH, Br, CI, F, OMe, CH 2 OH, 
CH 2 SH, CH 2 NH 2 ; 

R 4 is -NH(CH 2 ) 0 - 6 NR 5 R 6 or NH(CH 2 ) r CO NH (CH 2 ) 0 - 6 NR s R 6 or 
NHR 5 or NH(CH 2 ) r CONHR 5 , where R 5 and R 6 are independently 
10 chosen from H, CI, NO, N-acetyl, benzyl, d-6 alkyl, d-6 

alkylamine, d-6 alkyldiamine, d- 6 alkylcarboxylate, Ci-e 
alkenyl, d-eL, where L groups are independently chosen 
from biotin, oligodeoxynucleotide, N-ethylnitrosourea, 
fluorescein, bromoacetamide, iodoacetamide, DL-a-lipoic 
.15 acid, acridine, ethyl red, 4- (psoralen- 8 -yloxy) -butyrate, 

tartaric acid, ( + ) -a-tocopheral , and d-6 alkynyl, where r 
is an integer having a value ranging from 0 to 6; 

X, X a , X b , X e , X f , X* , X j , X n , X° are chosen 
independently from the group consisting of N, CH, COH, 
20 CCH 3 , CNH 2 , CC1, CF; and 

a, b, c, d, e, f, i, j, k, and m are integers chosen 
independently, having values ranging from 0 to 5; 
or a pharmaceutical^ acceptable salt thereof. 

The invention further comprises a polyamide having 
25 the formula: 
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Jd(m) 



10 



15 



20 



XXX 

where R 1 , R aa - ra) and R h{ i* m) are chosen independently from H, 
CI, NO, N-acetyl, benzyl, d_ 6 alkyl, Ci_ 6 alkylamine, d_ 6 
alkyldiamine, C x _ 6 alkylcarboxylate, Ci_ 6 alkenyl, and Ci_ 6 
alkynyl ; 

R 2 is selected from the group consisting of H, NH 2/ 
SH, CI, Br, F, N-acetyl, and N-formyl; 

R f<m> and R c(k - m) are selected independently from the 
group consisting of H, NH 2/ OH, SH, Br, CI, F, OMe, CH 2 OH, 
CH 2 SH, CH 2 NH 2 ; 

R 4 is -NH(CH 2 )o- 6 NR 5 R 6 or NH(CH 2 ) r CO NH (CH 2 ) 0 - 6 NR 5 R 6 or 
NHR 5 or NH(CH 2 ) r CONHR 5 , where R 5 and R 6 are independently 
chosen from H, CI, NO, N-acetyl, benzyl, Ci_ 6 alkyl, Ci_ 6 
alkylamine, C^g alkyldiamine, Ci_ 6 alkylcarboxylate, Ci- 6 
alkenyl, Ci- 6 L, where L groups are independently chosen 
from biotin, oligodeoxynucleotide, N-ethylnitrosourea, 
fluorescein, bromoacetamide, iodoacetamide, DL-a-lipoic 
acid, acridine, ethyl red, 4- (psoralen- 8 -yloxy) -butyrate, 
tartaric acid, ( + ) -a-tocopheral , and C x _ 6 alkynyl, where 
r is an integer having a value ranging from 0 to 6; 
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X, X a(i ' m) and X b(j ' m) are chosen independently from the 

group consisting of N, CH, COH, CCH 3 , CNH 2/ CC1 , CF; and 

a, b, c, d, e, f, g, h, i, j , k, 1, m, n, o and p 
are integers chosen independently, having values ranging 
5 from 0 to 5; 

or a pharmaceutical ly acceptable salt thereof. 
Baird, et al . (J. Am. Chem. Soc. 118: 6141-6146) and 
PCT/US97/003332 describe methods for synthesis of 
polyamides which are suitable for preparing polyamides of 

10 this invention. Polyamides of the present invention may 

be synthesized by solid phase methods using compounds 
such as Boc -protected 3 -methoxypyrrole, imidazole, and 
pyrrole aromatic amino acids, which are cleaved from the 
support by aminolysis, deprotected with sodium 

15 thiophenoxide, and purified by reverse-phase HPLC. The 

identity and purity of the polyamides may be verified 
using any of a variety of analytical techniques available 
to one skilled in the art such as 1H-NMR, analytical 
HPLC, and/or matrix-assisted laser-desorption ionization 

20 time-of -flight mass spectrometry (MALDI-TOF MS- 

monoisotropic) . 

In addition, the above polyamide subunits can be 
synthesized in small scale by methods known in the art. 
The synthesis of Boc-Py-OBt ester 7 (Grehn, L. and 

25 Ragnarsson, U. J". Org. Chem. 1981, 46, 3492.) and Boc-Im 

acid 11 (Grehn, et al . Acta. Chim. Scand. 1990, 44, 67.) 
has been previously described. Available procedures 
provide only milligram to gram quantities of monomer ( J". 
Org. Chem. 52, 3493-3500 (1987); Bailey, et al . Org. 

30 Synth. 51, 101 (1971); Nishsiwaki, et al . Heterocycles 

27, 1945 (1988). Bailey, et al . J". Pharm. Sci . 78, 910. 
(1989)). while requiring difficult column chromatography 
and the use of toxic chlorof luorophosgene for 
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introduction of the Boc group. An optimized synthesis, 
using inexpensive starting materials, has been developed 
by the present inventor allowing Boc-Py-OBt ester and 
Boc-Im acid monomers to be prepared on 50 g scale 
without the use of column chromatography. Two dimeric 
building blocks have also been prepared, Boc-Py-Im acid 
and Boc-y-Im acid. 

A general method for preparation of these compounds 
is as follows: 




The polyamide polymer can be a homopolymer of Py and 
Im subunits or a copolymer with strategically placed 
aliphatic amino acid monomers such as a-amino acids 
(including but not limited to the naturally occurring 
amino acids and preferably being glycine) ; amino acids of 
the formula -NH- (CH) n -C0- , where n is an integer from 1-12 
(preferably n being 1 as in p-alanine or 2 as in y- 
aminobutyric acid) . 

The carboxy terminus of the polyamide may comprise - 
NH(CH 2 ) 0 -6, NR X R 2 or NH(CH 2 ) b C0 NH (CH 2 ) o-eNR^ 2 , NHR 1 or 
NH(CH 2 ) b CO NHR 1 where b is an integer from 1-6 (preferably 
1) and R 1 and R 2 are independently chosen from Ci- 6 alkyl 
(preferably Ci_ 3 alkyl such as methyl, ethyl, isopropyl) , 
Ci- 6 alkylamine (preferably Ci- 3 alkylamine such as 
ethylarnine) , Ci_ 6 alkyldiamine (preferably alkyldiamine 
such as N,N-dimethylpropylamine) , Ci-« alkylcarboxylate 
(preferably a d- 3 alkylcarboxylate such as-CH 2 COOH) , Ci- 6 
alkenyl (preferably Ci_ 3 alkenyl such as CH 2 CH=CH 2 ) , Ci-s 
alkynyl (preferably Ci- 3 alkynyl such as -CH 2 CsCH 3 ) , or a 
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C x : 6 h where L includes but is not limited to biotin, 
oligodeoxynucleotide, , N-ethylnitrosourea, fluorescein, 
bromoacetamide, iodoacetamide , DL-cc-lipoic acid, 
acridine, ethyl red, 4 - (psoraen-8-yloxy) -butyrate, 
tartaric acid, (+) -a- tocopheral . 

Most preferred compounds of the instant" invention 
are polyamides core sequence composition: ImPyPyPy-y- 
PyPyPyPy , PyPylmPy-y- PyPyPyPy , ImPyPyPy-y- ImPyPyPy , 

PylmPyPy-y-PylmPyPy, I mPylmPy-y- PyPyPyPy, ImlmPyPy-y- 
PyPyPyPy , ImlmlmPy-y- PyPyPyPy , ImlmPyPy-y- ImPyPyPy , 

I mPy Py Py - y - 1 ml mPy Py , I ml mPy Py - y - 1 m I mPy Py , I mPy I mPy - y - 
ImPylmPy, ImlmlmPy-y- ImPyPyPyPy , Imlmlmlm-y- PyPyPyPy , Im- 
P-PyPy-y-Im-(}-PyPy, Im-P-Imlm-y-Py-p-PyPy, Im-p-ImPy-y-Im- 
p - 1 mPy , ImPy Py Py Py -y - 1 mPy Py PyPy , I ml mPy Py Py - y - 1 mPy Py Py Py , 
I mPy I mPy Py - y - 1 mPy Py Py Py , I ml mPy Imlm-y- Py PyPyPyPy , 
I mPy Py I mPy - y- ImPyPylmPy, I mPy -P-PyPy-y-I mPy - P - Py Py , I ml m - 
P-Imlm-y-PyPy-p-PyPy, ImPy-p- ImPy-y- ImPy-p- ImPy ImPy-p- 
Py Py Py - y - 1 mPy Py - P - Py Py , I ml m - P - Py Py Py - y - Py Py Py - p - Py Py , 

ImPy-p-ImPyPy-y-ImPyPy-P-PyPy, Imlm-p-PyPyPy-y- ImlmPy-p- 
PyPy , ImPy - p - PyPyPy-y- Py PyPy - p - ImPy , ImPyPy PyPyPy-y- 

I mPy PyPy PyPy , I mPy Py - P - PyPy - y - 1 mPy Py - p - PyPy , I mPyPyPy - p - 
Py-y- Im- p - PyPyPyPy , ImlmPyPyPyPy -y- ImlmPyPyPyPy , Im- P - 

PyPyPyPy -y-Im-p- PyPyPyPy , I mPy Py Py -p-Py-y-I mPy Py Py - P - Py , 
ImPy ImPyPy Py-y- ImPy PyPyPyPy, ImPy Py-p-Py Py-y- ImPy-p- 

Py Py Py , I mPy Py Py Py - p - y - 1 mPy Py Py Py - p , I mPy - p - 1 mPy Py - y - 1 mPy - 
p - ImPyPy , Im-p~ PyPyPyPy-y- ImPyPyPy- P-Py , Im- P - 1 m PyPy Py-y - 
I mPy Py Py - p - Py , I mPy Py - p - Py Py Py , I ml mPy - p - Py Py Py , I ml ml m- P - 
Py Py Py , I mPyPyPyPy - p - Py Py Py , I mPy Py Py - p - Py Py Py , I mPy Py - p - 
PyPy PyPy Py , I mPyPyPy - p - PyPyPyPy , I ml mPyPy- P - PyPyPyPy , 
ImlmlmPy-p- PyPyPyPy, ImPyPyPy-p-ImPyPyPy, ImlmPyPy-p- 
ImPyPyPy , ImlmPyPyPy- p - PyPy PyPy Py , ImlmlmPyPy- p- 
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PyPyPy Py Py , I ml m - P - Py Py - p - Py Py - p - p y p y , i ml mPy - p - Py Py Py - p - 
PyPyPy , ImlmPyPy- p - Py - p - PyPyPyPy , ImPyPy-y- ImPyPy- p - 

PyPyPy, ImPyPy-y- PyPyPy- p-PyPyPy, PylmPy-y-ImPyPy-p- 

PyPyPy, PyImPy-y-ImPyPy-p-PyPyPy-0-PyPyPy, ImlmPy-y- 

ImPyPy-p- PyPyPy, ImPyPy-y- ImPyPy-G- PyPyPy , ImPyPyPy-y- 
ImlmlmPy-p-PyPyPyPy, I ml mPy Py - y - 1 ml mPy Py - P - Py PyPyPy , and 
ImlmPy Py - y - Py PyPyPy - p - Py PyPyPy . 

The compounds of the following invention may be 
synthesized by any of several well-known and widely 
available techniques. Distamycin and its analogs can be 
produced by traditional multi-step synthetic organic 
chemistry (Weiss, et al. J. Am. Chem. Soc. 1957, 79, 
1266; Arcamone, et al . Gazz. Chim. Ital . 1967, 97, 1097; 
Penco, et al Gazz. Chim. Ital. 1967, 97, 1110; Bailer, et 
al. Tetrahedron 1978, 34, 2389.) The repeating amide of 
distamycin is formed from an aromatic carboxylic acid and 
an aromatic amine, both of which have proven problematic 
for solution phase coupling reactions. The aromatic acid 
is often unstable to decarboxylation and the aromatic 
amines have been found to be air and light sensitive. 
(Lown, et al. J. Org. Chem. 1985, 50, 3774.) Variable 
coupling yields, long reaction times (often > 24 h) , 
numerous side products, and reactive intermediates (acid 
chlorides and trichloroketones) are characteristic of the 
traditional solution phase coupling reactions, (Church, 
et al. Biochemistry 1990, 29, 6827. He, et al . J. Am. 
Chem. Soc. 1993, 115, 7061.) 

The process of expanding the 2:1 polyamide-DNA motif 
to include longer sequences recognized by increasingly 
complex polyamides is demanding. For example, using 
previously described multi- step solution phase 
chemistry, the total synthesis of the hairpin polyamides 
ImPyPy-y-PyPyPy-Dp required more than a month's effort. 
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The chemical structures of the polyamide of the 
prior art ImPyPy-y-PyPyPy-Dp, and the optimized hairpin 
polyamide ImPyPy-y-PyPyPy-P-Dp provided by the instant 



invention 
P i 



are 



Q i 



IraPyPy-y-PyPyPy-Dp 
K a = 7x 10 7 M" 1 
specificty: 24-fold 



shown 



below: 



0 i 



» o o 

ImPyPy.y-PyPyPy-P-Dp 
K a =lxl0 8 M' 1 
specificty: 60-fold 



Hereinafter hairpins may be shown as chemical 
structures binding to a schematic representation of the 
minor groove. An abbreviated representation may 

10 alternatively be used wherein, imidazole rings are 

represented as filled circles, pyrrole rings are 
represented as unfilled circles, 0-alanine is represented 
as a diamond, Glycine is represented as a triangle, amide 
bonds are represented as lines, y-aminobutyric acid is 

15 represented as a curved line, and the positively charged 

dimthylaminopropylamide is represented with a ( + ) . An 
example of both notations is shown below for the 
optimized 6-ring hairpin polyamide ImPyPy-y-PyPyPy-P-Dp 
binding to a cognate 5'-TGTTA-3' site: 
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It has been shown that the Py/Py pair is 
approximately degenerate for recognition of A,T base 
5 pairs, affording generality with regard to targeting 

sequences of mixed A«T/T*A composition. (White, et al . 
Biochemistry 35, 12532-12537 (1996)). To test the extent 
of this degeneracy, the affinity of the hairpin polyamide 
ImPyPy-y-PyPyPy-p-Dp was measured for eight possible five 

10 base pair 5 » -TG (A, T) 3 -3 ' match sites. Quantitative DNase I 

footprint titration experiments reveal that ImPyPy-y- 
PyPyPy-p-Dp binds all eight 5' -TG(A,T) 3 -3' target sites 
with only a 12 -fold difference in the equilibrium 
association constants between the strongest site, 5'- 

15 TGTTT-3 1 (K* = 2.1 x 10 8 M" 1 ) and the weakest site, 5'- 

TGAAT-3' (Ka = 1.8 x 10 7 M" 1 ) (10 mM Tris*HCl, 10 mM KCl , 
10 mM MgCl 2 , 5 mM CaCl 2 , pH 7.0, 22°C) . 

Sites are recognized with decreasing affinity: 5'- 
TGTTT-3 ' > 5'-TGTTA-3' > 5'-TGTAA-3' > 5'-TGTAT-3' > 5'- 

20 TGATT-3 ' > 5'-TGATA-3' > S'-TGAAA-S' > 5'-TGAAT-3' as 
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shown 



in 



schematic 



form 



below : 




5 * -T G A T T-3' 5 ' -T G A T A-3 * 




3 * -A C T A A-5' 3 ' -A C T A T-5 * 



K a = 3.9 x 10 7 VP 1 K a = 2.5 x 10 7 M" 1 



These results indicate that all sites of the form 
5 1 -TG(A,T) 3 -3 1 are structurally compatible with polyamide- 



y-PyPyPy-P-Dp for 5 ' -TG (A, T) 3 -3 ' binding sites may be 
grouped into two sets according to sequence composition: 
5' -TGT(A,T) 2 -3' and 5 ' -TGA (A, T) 2 -3 ' . ImPyPy-y-PyPyPy-0-Dp 
binds 5' -TGT (A,T) 2 -3 ' sites with between 2- fold and 12- 
fold higher affinity than 5 ' -TGA (A, T) 2 -3 ' sites. Therefore 
binding sites containing 5'-GT-3' steps may be preferred 
over those containing 5'-GA-3' steps for therapeutic 
targets. 

These results indicate that at least a 10 -fold range 
of binding affinities and sequence specificities will be 
observed for a polyamide binding to a designated set of 
match sites containing A»T base pairs. This relatively 
small range indicates that, in contrast to the Im/Py pair 
which may distinguish G*C from OG and both of these from 
A*T/ T#A base pairs, the Py/Py pair appears not to 
distinguish A*T from T^A base pairs. The similarity of 
the polyamide binding affinities for the eight 5'- 
TG(A,T) 3 -3' match sites reflects a limit to the 
specificity of the hairpin polyamide binding motif. 
Because G*C is distinct from OG, the most specific 
recognition will be observed for G*C rich sequences. 



5 



DNA complex formation. However, the affinities of ImPyPy- 
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In principle, individual polyamide subunits can 
recognize DNA with two possible binding orientations. 
Recognition of 5 1 -TGTTA-3 1 by a polyamide of core 
sequence composition ImPyPy-y-PyPyPy places the N-terminus 

5 of each polyamide subunit at the 5' -side of each 

recognized DNA strand. Placement of the polyamide N- 
terminus at the 3' side of each recognized strand would 
result in targeting of a B'-TCTTA-S' sequence. Each 
binding orientation represents a unique and 

10 distinguishable hairpin fold. Subunit orientation 

preference was not defined by the prior art, however, in 
order to successfully apply the pairing rules towards 
polyamide design, a single predictable subunit binding 
orientation must be preferred. 

15 A schematic model of two possible hairpin polyamide 

DNA-binding orientations is shown below: 

nEEEK 5 " T G T T A " 3, 

3-A C A A T-5' 
5 'to. 3* ? N to C recognition 



!5-T G T T A- 3* 

NEaa ^^> *oooo, 

:3*-A C A A T-5' 
5 ' to 3 1 ; C to N recognition 



It has been observed that a 30 -fold (2 kcal/mol) 
20 binding-orientation preference exists for a 6 -ring 

hairpin polyamide binding with the N- terminal end of each 
subunit located towards the 5' -side of the respective 
targeted DNA strand. The pyrrole -imidazole polyamide DNA- 
binding orientation preference defines a second order 
25 design rule which must be considered for successful 

application of the pairing rules for polyamide design. 

The potential degeneracy of the Im/Py and Py/Im 
pairs for recognition of G^C and OG has not been 
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sufficiently addressed by the prior art. The exocyclic 
amine group of guanine is symmetrically placed in the 
floor of the minor groove, and will therefore be 
displayed in the same location for OG and G*C base 
5 pairs. Single mismatch binding sites described in prior 

art were exclusively G*C to A»T substitutions. It was 
unclear to those skilled in the art whether sequences 
which differ by a single G»C to OG substitution would be 
discriminated by the pyrrole- imidazole polyamide-DNA 

10 motif. The rapid design of new polyamides for 

elucidation complete pairing rules was aided by the 
discovery that the hairpin- polyamide motif is compatible 
with solid phase synthetic methods. 

A series of four polyamides were prepared: ImPyPy-y- 

15 PyPyPy-p-Dp, ImlmPy-y-PyPyPy-p-Dp, ImPyPy-y-PylmPy-p-Dp, 

and ImlmPy-y-PylmPy-p-Dp. Each polyamide places a Py/Py, 
Im/Py, Py/Im, or Im/Im pair opposite either a T/A or G/C 
base pair in eight possible ring pairing-base pair 
combinations. The structure of four hairpin polyamides, 

20 which differ in the central ring pairings, are shown in 

Figure 2 . 

It was determined that Im/Py and Py/Im pairs 
effectively discriminate G»C from OG base pairs, 
respectively and that a Im/Im pairing represents an 

25 energetically unfavored pairing. Quantitative DNasel 

footprinting experiments reveal energetics of the four 
possible pyrrole-imidazole polyamide ring pairings. Py/Py 
is found to bind preferably to A^T/T•A » G»C/C*G, Im/Py 
binds G*C » T>A/A»T >OG, and Im/Im does not bind 

30 G*C/OG or A«T/T«A. A schematic representation of the 

eight possible ring pairing-base pair interactions is 
shown below: 
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5 ' -T G 



V 

3 '-A C 



i 

G 



T A-3' 



A T-5' 



Im/Py vs. G'C 
1.0 X10 8 M" 1 



5 ' -T G 



■5 

3' -A C 



ft 




T A-3 ' 



A T-5' 



Im/Py vs. T-A 
5.0 X10 6 IvT 1 



5'-T 




A-3 



T-5 ' 



Im/lm vs. G'C 
£l.0X10 6 M- 1 



5 ' -T Gl 

fK>0 
3' -A C 



T A-3' 




Im/lm vs. T«A 
sl.OX 10 5 M" 1 



A T-5' 




A-3' 



Py/lm vs. G-C 
sl.OX 10 6 M' 1 



A T-5' 



5 ' -T G 



3 

3'-A C 



T A-3' 



Py/lm vs. T*A 
2.5 X10 6 M" 1 



A T-5 r 



5 1 -T G 



10 



15 




Py/Py vs. G»C 
s1.0X 10 6 M" 1 




T A-3 1 



Py/Py vs. T*A 
1.4 X 10 s M" 1 



A T-5' 



These results show that G*C and OG base pairs may 
be distinguished in the minor groove, while the energetic 
penalty for formation of an Im/lm pairing provides a 
basis for design of specific unlinked overlapped 
polyamide complexes as will become evident below. 

It has been determined that the 6 -ring hairpin 
polyamide motif provides a versatile template for 
recognition of a wide variety of sequences in the DNA 
minor groove. (Parks, et al . J. Am. Chem. Soc, 118, 6153 
(1996); Szewczyk, et al . Angew. Chemie, 35, 1487-1489 
(1996); Swalley, et al . J*. Am. Chem. Soc. 118, 8198- 
8206(1996)). Six-ring hairpin polyamides recognize their 
cognate sites with affinities ranging from 1 x 10 7 M" 1 to 
1 x 10 8 NT 1 and specificity against single base pair 
mismatch sites ranging from 2- fold to 60 -fold. 

A schematic of nine 6 -ring hairpin polyamides 
recognizing cognate 5 base pair sites is shown below: 
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5'-W G W W W-3' 5 ' -W G G W W-3' 5 ' -W G G G W-3' 

•CKX M<X 

tKXXXx KKXKr 

3'-W C W W W-5 1 3 f -W C C W W-5' 3 ' -W C C C W-5' 

1x10 8 M" 1 1x10 8 M" 1 1x10 7 .M" 1 

5 ' ~ W rlAA. W " 3 ' 5 ' -W W C C W-3 ' 5 ' -W GGW W-3 ' 

fH^OO^P SK>0*2^ -EK^oJo^ 

3 -W W W G W-5 3'- W W G G W-5' 3 1 -W C C W W-5' 

1x10 7 M- 1 1x10 7 M" 1 <1x10 5 M" 1 

5'-W G W C W-3 1 5'-W G G C W-3' 5 ' -W G C W W-3' 

^K>oS#^ ^K^OO^ ^OoSo^ 

3 ' -W C W G W-5' 3'-W C C G W-5' 3 1 -W C G W 

1x10 8 M" 1 1x10 8 M- 1 1x10 8 M' 1 

The broad sequence repertoire recognized by the 6- 

ring hairpin motif represents a significant advance in 

ligand design. However, no 6 -ring hairpin polyamide has 

5 been identified which recognizes a target site with 

subnanomolar affinity. 

To determine the effect of polyamide length on 

binding site size, binding affinity, and sequence 

specificity, a series of six polyamides containing three 

10 to eight rings was synthesized. (Kelly, et al . Proc. 

Natl. Acad. Sex. U.S.A. 93, 6981-6985 (1996).) The 
series is based on ImPyPy-Dp with pyrrolecarboxamide 
moieties added sequentially to the C-termini to afford 
ImPyPyPy -Dp , ImPyPyPyPy-Dp , ImPyPyPyPyPy-Dp , 

15 ImPyPyPyPyPyPy-Dp, and ImPyPyPyPyPyPyPy-Dp which are 

designed to bind 5 to 10 base pair sites, respectively as 
side-by-side antiparellel dimers. DNA binding sites are 
based on a 5'-TGACA-3' core sequence and contain 
sequential A, T inserts in the center of the binding site 

20 that will be recognized by the additional pyrrole 

carboxamides . Chemical structures of the polyamides are 
shown in Figure 3 . 
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It was determined that polyamides based on 4 or 5- 
ring subunits are optimal, and that subunits must not 
contain more than 5 consecutive rings. Binding affinity 
reaches a maximum value for the five ring polyamide 
ImPyPyPyPy-Dp and addition of up to two additional 
pyrrolecarboxamides has no effect on the observed 
association constant (Table 2) . Furthermore, sequence 
specificity decreases as the length of the polyamides 
increases beyond five rings. 
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TABLE 2 



Table 1* 



polyamide-DNA complete association constant specificity* 


5 ' -T G A C A-3' 

•ooe 
+x>o# 

3 ' -A C T G T-5' 


1.3 x 10 5 M-i 


6.5-fold 


5 ' -T G T A C A-3' 

•oooe 
+xxx># 

3 '-A C A T G T-5" ! 


8.5 x 10 6 M-i 


5.3-fold 


5'-T G T A A C A-3' 

•0000$ | 

+>oooo# 

3 1 -A C A T T G T-5' 


4.5 x 10 7 M-i 


5.7-fold 


5--TGTTAAC A-3' 

•OOOCKXj- 
tOOOOO* 

3 ' -A C A A T T G T-5' 


5.3 x 10 7 M-i 


2.7-fold 


S'-TGTTAAAC A-3" 

•oooooo-e- 
+>oooooo« 

3 ' -A C A A T T T G T-5" 


4.7 x 10 7 M-i 


2.8-fold 


5'-TGTTTAAAC A- 33 ' 
•OOOOOOOC+ 

+H>000000« 

3'-ACAAATTTG T-55 ' 


<2 x 10 7 M-i 


1-fold 



* Values reported are the rrmean values from at least thre»e footprint titration 
experiments. The assays werre performed at 22 °C pH 7.0, iin the presence of 10 mM 
TrisHCl, 10 mM KCl, lOmM MggCl 2 , and 5mM CaCl 2 . 

* Defined as the ratio of tthe match site affinity to the: affinity of the single 
base pair mismatch site. 



*Values reported are the mean values from at least three 
footprint titration experiments. The assays were 

5 performed at 22°C, pH 7.0, in the presence of 10 mM Tris- 

HCl, 10 mM KCl, 10 mM MgCl 2 and 5 mM CaCl 2 - 

These results, specifically the failure of an eight - 
ring polyamide to recognize a 10-base pair target site 
10 suggested that a new class of polyamides was needed was 

needed for extension of the 2:1 polyamide-DNA motif to 
sequences longer than 9 base pairs. The present invention 
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provides for the replacement of a central pyrrole or 
imidazole amino acid with a more flexible amino acid 
subunit, thus allowing the antiparallel dimer ' to reset 
the register for continued gain in affinity and 



To identify a flexible linker amino acid, four 
polyamides of the formula ImPyPy-X-PyPyPy-Dp where X = 
Py, G (glycine) , p, or y, respectively, were synthesized 
and their equilibrium association constants determined 
for 5 1 -TGTTAAACA-3 1 (9 base pair) sites. (Trauger, et al . 
J". Am. Chew. Soc, 118, 6160 (1996).) 

The structures of polyamides based on ImPyPy and 
PyPyPy-Dp subunits linked by pyrrole or flexible glycine 
or (J-alanine linkers are shown below: 



It was determined that p-alanine is an optimal 
linker for joining polyamide subunits in an extended 
conformation, providing a useful structural motif for the 
design of new polyamides targeted to sequences longer 
than 7 base pairs. The p-alanine-linked compound ImPyPy- 
p-PyPyPy-Dp has the highest binding affinity of the four 
polyamides, binding the 9 bp site 5 1 -TGTTAAACA-3 1 (K* = 8 



5 



specificity. 



15 
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x 10 8 M' 1 ) with affinities higher than the formally N- 
methylpyrrole- linked polyamide ImPyPy-Py-PyPyPy-Dp by a 
factor of -10 . 

Solid phase synthesis involves the stepwise assembly 
5 of a molecule while one end is covalently anchored to an 

insoluble matrix at all stages of the synthesis. 
(Merrifield, J . Am. Chem. Soc. 85, 2149-2154 (1963) ; 
Merrifield, Science 232, 341-347.) The solid phase 
approach has been successfully developed for a variety of 

10 proteins' (Gutte, et al . 246, 1922-1941 (1971)), 

oligonucleotides (Kent, S.B.H. Ann. Rev. Biochem. 57, 
957-989 (1988); Caruthers, et al. Methods In Enzymology 
154, 287-313 (1987); Caruthers, M.H. Acc . Chem. Res. 24, 
278-284 (1991)) peptoids, (Simon, et al . Proc. Natl Acad. 

15 Sci. U.S.A., 89, 9367-9371 (1992); Zuckermann, et al . J. 

Am. Chem. Soc. 114 10646-10647 (1992)), oligosacharides 
(Science 269, 202-204 (1995); Science 260, 1307-1309 
(1993)), and small non-polymeric molecules (Ellman, J. A. 
Acc. Chem. Res. 29, 132-143.) General protocols have 

20 been developed for manual and machine-assisted Boc- 

chemistry solid phase synthesis of pyrrole- imidazole 
polyamides. (Baird and Dervan, J. Am. Chem. Soc, 118, 
6141 (1996)). More specifically, the following components 
were developed: (1) a synthesis which provides large 

25 quantities of appropriately protected monomer or dimer 

building blocks in high purity, (2) optimized protocols 
for forming an amide in high yield from a support bound 
aromatic amine and an aromatic carboxylic acid, (3) 
methods for monitoring reactions on the solid support, 

30 (4) a stable resin linkage agent that can be cleaved in 

high yield upon completion of the synthesis. Solid phase 
synthesis protocols for pyrrole- imidazole polyamides 
reduce the synthetic investment from months to days. 
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A representative solid phase synthesis of a 
polyamide is shown in Figure 4. Polyamides containing 
more than 4 residues are preferably prepared' by solid 
phase methodology. For solid phase synthesis, the 
polyamide is attached to an insoluble matrix by a linkage 
which is cleaved by a single step process which 
introduces a positive charge into the polyamide. The 
addition of an aliphatic amino acid at the C-terminus of 
the pyrrole- imidazole polyamides allows the use of Boc- 
p-alanine-Pam-Resin resin which is commercially available 
in appropriate substitution levels (0.2 mmol/gram) 
(Mitchell, et al . J. Org. Chem. 1978, 43, 2845.) 
Aminolysis of the resin ester linkage provides a simple 
and efficient method for cleaving the polyamide from the 
support . 

Solid phase polyamide synthesis protocols were 
modified from the in situ neutralization Boc-chemistry 
protocols recently reported by Kent and coworkers. 
(Schnolzer, et al . Int. J. Peptide. Protein. Res. 1992, 
40, 180; Milton, et al . Science 1992, 256,1445.) Coupling 
cycles are rapid, 72 min per residue for manual synthesis 
or 180 min per residue for machine-assisted synthesis, 
and require no special precautions beyond those used for 
ordinary solid phase peptide synthesis. The manual solid 
phase protocol for synthesis of pyrrole -imidazole 
polyamides has been adapted for use on a ABI 430A peptide 
synthesizer. Stepwise cleavage of a sample of resin and 
analysis by HPLC indicates that high stepwise yields (> 
99%) are routinely achieved. 

The large number of polyamides made available by 
solid phase synthetic methodology makes possible the 
elucidation of the rules necessary for development of 
polyamides which bind DNA with subnanomolar affinities. 
Cleavage of the polyamide from the resin with a primary 
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diamine provides a polyamide having an unmodified primary 
amine group. The amine group may then be modified with 
an activated carboxylic acid or by nucleophilic aromatic 
substitution to provide a bifunctional polyamide. 
5 Standard techniques available to one skilled in the 

art may be used to determine the DNA binding properties 
of novel pyrrole- imidazole polyamides . Affinity cleaving 
titration experiments ((25 mM Tris-Acetate , 20 mM NaCl , 
100 mM bp calf thymus DNA, pH 7, 22°C, 10 mM DTT ; 10 mM 

10 Fe(II)) using polyamides modified with EDTA»Fe ( II ) at the 

C- terminus are used to determine oriented binding. 
MPE*Fe(II) footprinting experiments (Hertzberg and 
Dervan, J. Am. Chem. Soc. , 104, 313 (1982); Van Dyke and 
Dervan, Biochemistry, 22, 2373 (1983); Van Dyke and 

15 Dervan, Nucleic Acids Res., 11, 5555 (1983); Hertzberg 

and Dervan, Biochemistry, 23, 3934 (1984)) (25 mM Tris- 
acetate, 10 mM NaCl, 100 /xM calf thymus DNA, 5 mM DTT, pH 
7.0 and 22 °C) are used to determine binding site size. 
Quantitative DNasel footprinting (Brenowitz, et al . 

20 (1986) . Methods Enzymol . 130, 132-181.; Fox and Waring 

(1984). Nucleic Acids Res. 12, 9271-9285 Brenowitz, M. , 
Senear, D.F., Shea, M.A. & Ackers, G.K. (1986); Proc. 
Natl. Acad. Sci. U.S.A. 83, 8462-8466.) (10 mM Tris-HCl, 
10 mM KC1, 10 mM MgCl 2 , and 5mM CaCl 2 , pH 7.0, 22 °C) 

25 reveals the equilibrium association constants for binding 

to match and mismatch sites. All footprinting experiments 
are performed on 3' and 5' 32 P end restriction fragments 
derived from plasmids. 3 '-shifted cleavage patterns are 
consistent with location of the polyamide in the minor 

30 groove . 

Tert-butoxycarbonylaminoacyl-4- (oxymethyl) -phenyl - 
acetamidomethyl -resin (PAM resin) is commercially 
available and cleaved in high yield by aminolysis with 
primary amines. (Mitchell, A.R. ; Kent, S.B.H., Engelhard, 
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M. Merifield, R.B., J . Org. Che/n. 43, 2845.) Insertion 
of a C-terminal aliphatic amino acid residue makes the 
hairpin- polyamide motif compatible with solid phase 
synthetic methods, allowing the rapid design of new 
polyamides. This result sets the stage for the 
elucidation of the limits of hairpin motif with regards 
to binding site size, binding affinity, and sequence 
specificity. 

A schematic representation of the recognition of a 
nine base pair target site, by a polyamide containing a 
P- spring is shown below: 



p - spring 




^ <C=3 0 - spring 



2(lmPyPy-p-PyPyPy-Dp) 
5'-TGTTAAACA-3' 

K a = 8 x 10 8 M- fl 



The binding data for ImPyPy-y-PyPyPy-Dp, which was 
shown previously to bind DNA in a "hairpin" conformation, 
indicates that y-aminobutyric acid does not effectively 
link polyamide subunits in an extended conformation. The 
discovery of P-alanine as an effective linker for joining 
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polyamide subunits in an extended conformation, provides 
a useful structural motif for the design of new 
polyamides based on subunits < 5 -rings targeted to sites 
longer than 7 bp. 

5 At least two distinct binding modes are expected to 

form for the ImPyPy-X-PyPyPy-Dp polyamides described 
above that bind in an extended conformation. These 
binding modes as "slipped" and "overlapped" . In the 
overlapped (9 base pair) binding mode, two ImPyPy-X- 

10 PyPyPy-Dp polyamides bind directly opposite one another. 

The "slipped" (13 base pair) binding mode integrates the 
2:1 and 1:1 polyamide-DNA binding motifs at a single 
site. In this binding mode, the ImPyPy moieties of two 
ImPyPy-X-PyPyPy-Dp polyamides bind the central 5'-AGACA- 

15 3' sequence in a 2:1 manner as in the ImPyPy homodimer, 

and the PyPyPy moieties of the polyamides bind to A, T 
flanking sequences as in the 1:1 complexes of distamycin. 

A schematic model of the "slipped" and "overlapped" 
binding modes is shown below. 



20 



25 



30 



5»- T GTTAAAC A-3 ' 5 ' -A AAAAGAC AAAA A-3 ' 

•OOoOOO tOOvOOO 

a ^oock>oo# <xxxkx>« 

3-ACAATTTG T-5 ' 3*-TTTTTCAGTTTT T-5 ' 
"overlapped" binding mode "slipped" binding mode 



The present invention provides P-alanine as an 
optimal linker for joining polyamide subunits in a 
"slipped" extended conformation, providing a structural 
motif whereby a MW « 900 polyamide recognizes a 13 base- 
pair DNA sequence. The p-alanine-linked compound ImPyPy - 
P-PyPyPy-Dp binds to a 13 bp 5 « -AAAAAGACAAAAA- 3 ' site 
with an association constant. Ka = 5 x 10 9 M" 1 , that is 
higher than the formally tf-methylpyrrole- linked polyamide 
ImPyPy- Py-PyPyPy-Dp by a factor of -85. 
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-As described above, y-aminobutyric acid, and preferably 
P-alanine, effectively link polyamides in hairpin and 
extended conformations, respectively. It has also been 
demonstrated that y-aminobutyric does not optimally link 
polyamide subunits in extended conformations, and that P- 
alanine does not optimally link polyamide subunits in 
hairpin conformations. These results suggested that y- 
aminobutyric acid and p- alanine could be combined within 
a single polyamide with predictable results. (Trauger, et 
al, Chem. &Biol. t 3, 369 (1996)). 

It has been determined that the nine -ring "extended 
hairpin" polyamide ImPyPy-y-ImPyPy-p-PyPyPy-G-Dp binds a 
9-bp target site 5 ■ -AAAAAGACA-3 ' at 0.05 nM 
concentration, an increase in affinity relative to the 
six-ring hairpin polyamide ImPyPy-y-lmPyPy-p-Dp of -400- 
fold. These results provide a strategy for increasing 
the DNA-binding affinity of hairpin polyamides into the 
subnanomolar range. Furthermore, as will become evident 
below, many important DNA binding transcription factors 
such as TBP and homeodomain proteins have A, T rich 
consensus sequences. Extended hairpin polyamides provide 
a general method by which a polyamide may interfere with 
protein-DNA interactions by recognizing a unique sequence 
adjacent to certain protein binding sites. A schematic 
binding model of extended hairpin polyamide recognition 
of a 9 base pair sequence is shown below: 
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5'-AAAAAGAC A-3 ' 
•OCK 

+XK>0<X>00# ; 

3»-TTTTTCTG T-5 ' 




ImPyPy-Y-lmPyPy-p-PyPyPy-G-Dp 

K a = 2xl0 10 M' 1 

Provided herein are extended hairpin polyamide 
motifs that provide versatile templates for recognition 
of a wide variety of sequences in the DNA minor groove. 
Extended hairpin polyamides recognize their 9 to 13 base 
pair sites target site with affinities ranging from 1 x 
10 8 M" 1 to >5 x 10 10 M" 1 and specificity against single base 
pair mismatch sites ranging from 5- fold to 60 -fold. A 
schematic of nine extended hairpin polyamides containing 
9 to 12 rings and recognizing 9 to 13 base pair target 
sites is shown in Figure 5. 

Provided herein is an endonuclease protection assay 
to measure the rate of polyamide-DNA complex formation. 
Such an assay may comprise a labeled restriction fragment 
comprising a polyamide binding site . that overlaps a 
restriction endonuclease cleavage site. Cleavage by the 
cognate is prevented when the overlapping polyamide 
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binding site is occupied by the polyamide. As a control, 
a second labeled DNA fragment may be that contains the 
restriction site, but lacks the overlapping ' polyamide 
binding site. The rate of polyamide association with its 
target binding site may be assessed by incubating the 
solutions of the polyamide with the labeled target and 
reference fragments for a sufficient timer period. Using 
the experimental conditions provided herein, the 
reference site is nearly completely digested, but 
protection at the target site is observed and can be 
correlated with polyamide concentration and the time of 
equilibration. Similarly, the dissociation rate is 
analyzed by adding an excess of unlabeled competitor DNA 
to an equilibrated solution of the labeled DNA fragments 
and polyamide. Addition of the competitor reduces the 
concentration of free polyamide to zero. The rate at 
with polyamide dissociation occurs from the target site 
on the labeled fragment can be followed by the rate of 
loss of protection from restriction enzyme digestion as 
the re-equilibration time is increased. 

First generation six-ring hairpin polyamides bind 
DNA with association constants of approximately 1 x 10 8 
NT 1 (Figure 6) The observation that unlinked four-ring 
polyamides form 2:1 complexes with 70-fold-higher 
affinity relative to three-ring polyamides suggested an 
eight-ring hairpin polyamide motif for recognition of 
DNA at subnanomolar concentration. The present inventor 
has shown that two eight -ring pyrrole -imidazole 
polyamides differing in sequence by a single amino acid 
bind specifically to respective six base pair target 
sites which differ in sequence by a single base pair. 
(Trauger, et al . Nature, 382, 559-561 (1996)). Binding 
is observed at subnanomolar concentrations of ligand. 
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DNA-binding affinities were determined for two 
eight -ring hairpin polyamides , ImPyPyPy-y- ImPyPyPy-p-Dp 
and ImPyPyPy-y-PyPyPyPy-p-Dp, which differ by a single 
amino acid, for two 6 base pair (bp) target sites, 5'- 
AGTACT-3' and 5 ' -AGTATT-3 1 , which differ by a single base 
pair. Based on the pairing rules for polyamide-DNA 
complexes , the sites 5 ' -AGTACA-3 ' and 5 » -AGTATT-3 ■ are 
for ImPyPyPy-y-lmPyPyPy-p-Dp "match" and "single base pair 
mismatch" sites, respectively, and for polyamide 
ImPyPyPy-y-PyPyPyPy-p-Dp "single base pair mismatch" and 
"match" sites, respectively. Binding models for 5»- 
AGTACT-3 ' and 5 ' -AGTATT-3 ' in complex with ImPyPyPy-y- 
ImPyPyPy-p-Dp and ImPyPyPy-y-PyPyPyPy-p-Dp are shown in 
Figure 7. 

I mPy Py Py -y - 1 mPy Py Py - p - Dp and I mPy Py Py - y - Py Py Py Py - p - Dp 
were synthesized by solid phase methods and purified by 
reversed phase HPLC. Equilibrium association constants 
for match and mismatch six base pair binding sites on a 
32 

3'- P-labeled 229 bp restriction fragment were 
determined by quantitative DNase I footprint titration 
experiments. ImPyPyPy-y- ImPyPyPy-p-Dp binds its match 
site 5 , -AGTACT-3' at 0.03 nM concentration and its single 
base pair mismatch site 5 1 -AGTATT-3 1 with nearly 100-fold 
lower affinity. ImPyPyPy-y-PyPyPyPy-p-Dp binds its 
designated match site 5 1 - AGTATT-3 1 at 0.3 nM 
concentration and its single base pair mismatch site 5 1 - 
AGTACT-3 1 with nearly 10-fold lower affinity. The 
specificity of ImPyPyPy-y- ImPyPyPy-p-Dp and ImPyPyPy-y- 
PyPyPyPy-p-Dp for their respective match sites results 
from very small structural changes. Replacing a single 
nitrogen atom in ImPyPyPy-y-ImPyPyPy-p-Dp with C-H reduces 
the affinity of the polyamide* 5 1 -AGTACT-3 ' complex by 
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-7-5-fold representing a free energy difference of -2.5 



polyamide^S'-AGTATT^' complex -10-fold, a loss 



These results show that using a simple molecular 
shape and a two letter aromatic amino acid code, pyrrole- 
imidazole polyamides can achieve affinities and 
specificities comparable to DNA-binding proteins. It 
remained to be determined if additional motifs could be 
discovered to provide polyamides with subnanomolar 
binding affinities. 

It has been suggested that pyrrole-imidazole 
polyamides would bind G/C rich sequences with low binding 
affinity due to steric hindrance with the exocyclic 
amines of the guanine bases. It has also been noted that 
the lower negative electrostatic potential of a G/C rich 
minor groove relative to an A, T rich minor groove might 
prohibit high affinity binding. (Pullman, et al . 
Quarterly Reviews of Biophysics. (1981) 14, 289-380; 
Pullman, B. Advances in Drug Research. (1989) 18, 1- 
113. Manning, G.S. Q. Rev. of Biophysics. (1978) 11, 
179-246; Honig and Nicholls. Science (1995) 268, 1144.) 
It has been found that an 8 -ring hairpin polyamide can 
recognize a G/C rich target sequence with subnanomolar 
affinity. 

Schematic binding models of eight -ring hairpin 
polyamides designed for recognition of 5'- 
(A/T) (G/C) 4 (A/T) -3 ' sequences. 
5 f -W G G C C W-3' 5 * -W G C G C W-3 1 5 ' -W G G G G W-3 ' 



kcal/mole. Similarly, replacing a C-H in ImPyPyPy-y- 
PyPyPyPy-P-Dp with N reduces the affinity of the 



5 



binding energy of -1.3 kcal/mol. 






30 



3'-W C C G G W-5* 
K a =1x10 1 <>M- 1 



3 f -W C G C G W-5' 
K a = 4x10 7 M* 1 



3'-W C C C C W-5' 
K a = 3x10 7 M" 1 
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To examine whether a core sequence of purely G,C 
base pairs could be recognized with high affinity and 
specificity, three eight-ring hairpin polyamides which 
differ only by the arrangement of pyrrole and imidazole 

5 amino acids, ImlmPyPy-y- ImlmPyPy-p-Dp, ImPylmPy-y- 

ImPylmPy-P-Dp, and Imlmlmlm-y-PyPyPyPy-p-Dp were designed 
for recognition of three core sequences consisting of 
solely G,C base pairs. DNase I footprint titrations 
allow the determination of equilibrium association 

10 constants (K a ) for each polyamide. ImlmPyPy-y- ImlmPyPy-P- 

Dp binds the match site 5'-TGGCCA-3' with an equilibrium 
association constant of Ka = 1 x 10 10 M" 1 (10 mM Tris^HCl, 
10 mM KC1, 10 mM MgCl 2 and 5 mM CaCl 2/ pH 7.0 and 22°C) . 
The two designed double base pair mismatch sequences, 5'- 

15 TGCGCA-3' and 5 ' -TGGGGA-3 ' , are bound with at least 200- 

fold reduced affinity. ImPylmPy-y-ImPylmPy-P-Dp binds the 
site 5'-TGCGCA-3' with a K a = 4 x 10 7 M" 1 with 4-fold 
specificity, and Imlmlmlm-y-PyPyPyPy-p-Dp binds the site 
5 ' - TGGGGA-3 " with a Ka = 3 x 10 7 M" 1 with 6-fold 

20 specificity. 

These results indicate that the positioning of the 
Im amino acids have a profound effect on the binding 
affinities of pyrrole- imidazole polyamides. More 
specifically these results indicate that binding affinity 

25 could be restored by the design of hairpin polyamides 

where a pyrrole ring has been substituted by more 
flexible spacer amino acid such as P-alanine. 

It has been found that replacement of a pyrrole 
residue with a p-alanine spacer residue in each subunit 

30 of ImPylmPy-y-ImPylmPy-P-Dp provides an eight residue 

hairpin polyamide, Im-p- ImPy-y- Im-P-ImPy-p-Dp, which 
recognizes 5'-TGCGCA-3' sequences with subnanomolar 
affinities. 
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Structures and schematic binding models for the 
eight ring hairpin polyamide ImPylmPy-y-ImPylmPy-P-Dp and 
the eight residue hairpin polyamide Im-p-ImPy-y-Im-p-ImPy- 
p-Dp are shown in Figure 8. 

5 It has been found that the four ring hairpin 

polyamide motif provides a versatile template for 
recognition of a wide variety of sequences in the DNA 
minor groove. Eight ring and residue hairpin polyamides 
recognize 6 base pair target sites with affinities 

10 ranging from 1 x 10 7 M" 1 to >1 x 10 10 M" 1 and specificity 

against single base pair mismatch sites ranging from 2- 
fold to > 100-fold. A schematic of fifteen 8-residue 
hairpin polyamides recognizing 6 base pair target sites 
is shown in Figure 9. 

15 First generation fully overlapped p-linked 

polyamides based on three ring subunits bind DNA with 
association constants of approximately 8 x 10 8 M" 1 . The 
observation that unlinked four-ring polyamides form 2:1 
complexes with 70 -fold-higher affinity relative to three- 

20 ring polyamides suggested a fully overlapped 8-ring 4-P-4 

polyamide motif for recognition of 11 base pairs of DNA 
at subnanomolar concentration. The chemical structures of 
three 4-p-4 polyamides are shown in Figure 10. 

It has been found that three eight ring 4-P-4 

25 pyrrol e - imida zole polyamide , ImlmlmPy - P - PyPyPyPy - P - Dp , 

I ml mPy Py - P - Py Py Py Py ~P~Dp and I mPy Py Py - P - Py Py Py Py - p - Dp 
specifically recognize targeted 5'-AGGGATTCCCT-3' , 5*- 
AGGTATTATCCT - 3 " and 5 ' - AGTAATTTACT - 3 ' sites, 

respectively. DNase I footprint titrations reveal that 

30 each polyamide binds its respective target site at 

subnanomolar concentrations with equilibrium association 
constants ranging from = 7 x 10* M" 1 to 5 x 10 10 M" 
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x , • and with 7 to 30-fold specificity over double base 
pair mismatch sites. 

The ability of 3-0-3 and 4-p-4 polyamides to 
recognize both "slipped" and "overlapped" complexes for 
5 recognition of two separate classes of target sites 

represents a limit to the sequence specificity of the (}- 
extended polyamide motif. The discovery that a Im/Im 
polyamide pairing is disfavored, suggests that the 4-p-4 
polyamide ImlmlmPy-p-PyPyPyPy-p-Dp should bind 

10 preferentially in the fully overlapped polyamide motif. A 

schematic representation of the recognition of three 
targeted DNA sites by three 4-p-4 polyamides is shown in 
Figure 11. 

The 4-y-4 polyamide ImlmlmPy-y-PyPyPyPy-p-Dp binds a 

15 5 ' -AGGGAA-3 ' target site in a hairpin conformation with 

an association constant of Ka * 4 x 10 8 . The 4-y-4 
polyamide ImlmlmPy-y-PyPyPyPy-p-Dp is related to the 4-p-4 
polyamide ImlmlmPy-p-PyPyPyPy-p-Dp by deletion of a 
single methylene unit (MW = 14) from the linker region. 

20 The y and P linkers specificity turn and extended binding 

respectively and enlarge targeted binding site size from 
6 to 11 base pairs, resulting in a 2.1 kcal/mol 
enhancement in binding energy. These results, the 
specific recognition of a G,C-rich 11 base pair sequence, 

25 represent a significant advance in the development of 

general DNA-binding that can recognize a single site in 
the human genome. 

It has been determined that there exists at least a 
20-fold preference for placement of a p/p pair opposite 

30 an A*T or T*A base pair relative to a G^C or OG base 

pair. Quantitative DNase I footprint titration 

experiments reveal that ImlmlmPy-p-PyPyPyPy-p-Dp binds 
the designed match site 5'-AGGGAATCCCT-3 ' with an 
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equilibrium association constant of Ka 1.4 x 10 10 M* 1 and 
the single base pair P/P mismatch sequence 5'- 
AGGGAGTCCCT-3 ' with at least 20-fold lower affinity (K a = 
6.9 x 10 8 M" 1 ) . These results implicate the p /p 

5 combination as both a flexible spacer unit and a 

sequence-specific DNA binding element. The specificity of 
the P/P pairing reveals an additional pairing rule 
pivotal to the design of polyamides for recognition of 
longer binding sites. A schematic model of placement of 

10 the P/P pair opposite G,C or A, T base pairs is shown in 

Figure 12 . 

It has been found that the extended, fully 
overlapped polyamide-DNA motif, provides a versatile 
template for recognition of symmetric sequences 

15 containing from 9 to 13 base pairs in the minor groove. 

Equilibrium association constants for cooperative complex 
formation range from Ka = 1 x 10 7 M" 1 to K a > 1 x 10 11 M" 1 . 
Specificities have been found to range from 2- fold to 
>20-fold for discrimination of single base pair mismatch 

20 sites. A schematic representation of several p- linked 

fully overlapped polyamide complexes is shown in Figure 
13 . 

To further expand the targetable binding site size 
and sequence repertoire available to the hairpin 

25 polyamide motif, two polyamides containing either two or 

three Im amino acid residues, ImPyPyPyPy-y-ImPyPyPyPy-P-Dp 
and ImlmPyPyPy-y-ImPyPyPyPy-P-Dp, were prepared by solid 
phase synthetic methodology and their DNA binding 
properties analyzed. The structures of two 10-ring 

30 hairpin polyamides are shown in Figure 14 . 

It has been shown that that ImPyPyPyPy-y-ImPyPyPyPy- 
P-Dp binds the formal 7 bp match sequence 5 ' -TGTAACA- 3 ' 
with an equilibrium association constant (Ka) of Ka 1-2 
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x 10 10 M" 1 and the single base pair mismatch sequence 5'- 
TGGACA- 3 ' with K a = 6.8 x 10 8 M' x . (10 mM Tris*HCl, 10 mM 
KC1, 10 mM MgCl 2 , 5 mM CaCl 2/ pH 7.0, 22 °C) . 
ImlmPyPyPy-y-ImPyPyPyPy-p-Dp, which differs from 

5 ImPyPyPyPy-y-ImPyPyPyPy-p-Dp by a single amino acid 

substitution binds its formal match sequence 5 ' -TGGAACA- 
3' with an equilibrium association constant of K a = 3 . 6 x 
10 9 M" 1 and its corresponding single base pair mismatch 
sequence 5 ' -TGTAACA-3 ' with Ka < 1 x 10 7 M" 1 . The 

10 replacement of a single electron lone-pair with a 

hydrogen atom within a -1500 MW polyamide is found to 
modulate affinity and specificity by more than an order 
of magnitude. Sequence-specific recognition of a 7 bp 
target site by a ten-ring hairpin polyamide at 

15 subnanomolar concentration expands the effective 

targetable sequence repertoire of the pyrrole- imidazole 
polyamide-DNA motif. 

A schematic model of two 10 -ring hairpin polyamides 
recognizing .match and mismatch 7 base pair sequences is 

20 shown below: 

5'-T G T A A C A-3' 5'-T g|a A C A-3 

•OOOCX 




3 ' -A C A T T 6 T-5' 3 ' -A C A T T G T-5 ' 

K a = lxl0 10 K a <lxl0 7 

5'-T gBIaAC A-3' 5'-T G G A A C A-3' 
3' -A C C T T G T-5' 3 ' -A C C I T G T-5' 

K.ZZ7X10 8 K a = 4xl0 9 

The specificity of that ImPyPyPyPy-y-lmPyPyPyPy-p-Dp 
and that ImlmPyPyPy-y-ImPyPyPyPy-P-Dp for their respective 
match sites results from very small structural changes. 
25 Replacing a single C-H in that ImPyPyPyPy-y- ImPyPyPyPy-p- 
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Dp with a nitrogen atom as in that ImlmPyPyPy-y- 
ImPyPyPyPy-p-Dp reduces the affinity of the ImlmPyPyPy-y- 
I mPy Py PyPy - f$ - Dp • 5 ' - TGTAACA - 3 ' complex relative to the 
ImPyPyPyPy-y-ImPyPyPyPy-p-Dp»5' -TGTAACA- 3 ' complex by > 

5 300-fold, a free energy difference of at least 4 

kcal/mol. Similarly, replacing a N in that ImlmPyPyPy-y- 
ImPyPyPyPy-p-Dp with a C-H as in that ImPyPyPyPy-y- 
ImPyPyPyPy-P-Dp, reduces the affinity of the ImPyPyPyPy-y- 
ImPyPyPyPy-p-Dp«5' -TGGAACA-3 ' complex relative to the 

10 ImlmPyPyPy-y-ImPyPyPyPy-p-Dp^B' -TGGAACA-3 ' complex by a 

factor of 5- fold, a loss in binding energy of ~1 
kcal/mol. The reduced overall specificity and binding 
affinity of that ImlmPyPyPy-y-ImPyPyPyPy-p-Dp relative to 
that ImPyPyPyPy-y-ImPyPyPyPy-p-Dp most likely results 

15 from the presence of a 5'-GA-3' step in the designated 

target site. 

A polyamide, ImPy-p-ImPy-y-ImPy-p-ImPy-p-Dp, based on 
P-alanine linked 2-ring subunits was prepared to target a 
seven basepair region adjacent to a binding site for the 

20 transcription factor TBP in a conserved HIV gene-promoter 

sequence. The polyamide was designed based on the 
pairing rules described herein, and was found to 
recognize its designated 5 ' -TGCTGCA-3 ' target sequence 
with a binding affinity of Ka = 3 . 6 x 10 9 M' 1 . An isomeric 

25 mismatched polyamide, ImPy-p-ImPy-y-ImPy-p-ImPy-p-Dp, 

which differs only by the position of the Py and Im amino 
acids within the 2-P-2 -y-2-P~2 molecular template binds 
the targeted 5 -TGCTGCA-3 ' sequence with 100-fold reduced 
affinity. A schematic representation of a polyamide and a 

30 control polyamide which are molecular isomers, yet 

discriminate a 7 -base pair sequence of an HIV gene 
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promoter with a 100-fold specificity is shown in Figure 
15. 

These results reveal that hairpin polyamides based 
on 5-ring subunits provide a useful structural motif for 
5 the recognition of 7 bp binding sites at subnanomolar 

concentrations. For targeting 5' -WGWWWCW-3 ' sequences a 
5-y-5 polyamide, ImPyPyPyPy-y-ImPyPyPyPy-p-Dp based on 2 
y-aminobutyric acid linked 5-ring subunits is preferred 
over the corresponding p-substituted, 2-P-2-y-2-p-2 

10 polyamide ImPy-p-PyPy-y- imPy-p-PyPy-p-Dp . For targeting 

5' -WGCWGCW-3' and 5 ' - WGGWGGW - 3 ' sequences the respective 
P-substituted, 2 -p-2 -y-2-p~2 polyamides ImPy-P-ImPy-y- 
ImPy-P-ImPy-p-Dp and Imlm-P-Imlm-y-PyPy-P-PyPy-p-Dp are 
preferred over the respective 5-y-5 polyamides, 

15 ImPyPylmPy-y-ImPyPylmPy-p-Dp and ImlmPylmlm-y-PyPyPyPyPy- 

P-Dp based on y-aminobutyric acid linked 5-ring subunits. 
A series of hairpin polyamides which recognize 7 base 
pair target sites are shown in Figure 16. 

The present inventor has discovered that a p/p 

20 pairing is preferred to a Py/p pairing for extension of 

the targetable binding site size of the hairpin polyamide 
motif. Three "12-ring hairpin" polyamides, ImPyPyPyPyPy- 
Y-ImPyPyPyPyPy-p-Dp, ImPyPy-P-PyPy-y-ImPyPy-p-PyPy-p-Dp 
and ImPy-p-PyPyPy-y-ImPyPy-p-PyPy-p-Dp were synthesized by 

25 solid phase synthetic methodology. 

TABLE 3 
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Mismatch 



Mismatch 



Specificity 



S'-TGTTAAC A-3 ' 



3'-ACAATTG T-5 ' 

K a = 2.5xl0 9 



5 1 -T G TBA A C A- 3 1 




3'-ACACTTG T-5' 

K a = 3.9x 10 8 



6-fold 



S'-TGTTAAC A-3' 



5 ' -T G TBA A C A-3' 



3*-ACAATTG T-5' 

K a = 2.9 x 10 9 




6-fold 



K a = 4.5x 10' 



5 1 -T G T T A A C A-3' 



3'-ACAATTG T-5' 

K a =1.2 xlO 11 




55-fold 



K a = 2.2 x 10 



DNase I footprint titrations reveal that the hairpin 
polyamide based on 6-ring subunits, ImPyPyPyPyPy-y- 
ImPyPyPyPyPy-p-Dp, binds the formal 8 bp match sequence 
5 ' -TGTTAACA- 3 ' with an equilibrium association constant 
(Ka) of Ka a 4 x 10 9 M" 1 and the single base pair mismatch 
sequence 5 ' - TGTGAACA - 3 ' with K a = 2 x 10 8 NT 1 . ImPyPy-p- 
PyPy-y-ImPyPy-p-PyPy-p-Dp which differs from ImPyPyPyPyPy- 
y-ImPyPyPyPyPy-p-Dp by substitution of two flexible 
aliphatic amino acid residues for two pyrrole rings, 
binds a 5 ' -TGTTAACA - 3 ' match site Ka = 2 x 10 10 M" 1 and a 
5' -TGTGAACA- 3' mismatch with K a = 1 x 10 9 M" 1 . ImPy-p- 
PyPyPy-y-ImPyPy-p-PyPy-p-Dp binds a 5 ' -TGTTAACA- 3 1 match 
site with an equilibrium association constant of Ka 1 x 
10 11 and a single base pair mismatch sequence 5 1 - TGTGAACA - 
3' with K a < 1 x io 9 . (Table II). These results expand the 
targetable binding site size accessible to the hairpin 
polyamide motif to 8 base pairs. 
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■ P/P pairing within the hairpin polyamide motif as 
shown below completely abolishes DNA-binding: 

5'-TGTTAAC A-3' 



The present inventor has found that a paired p/p 
substituted hairpin motif allows specific targeting of 
sequences of the form 5 ' -WGWGWWCW-3 ' . Substitution of a 
P/P pair for the second pyrrole pairing of a 12 -ring 
hairpin polyamide, provides polyamides which target a 
wide variety of 8 base pair sequences of mixed sequence 
composition. Sequences are bound with subnanomolar 
affinity and 50-100 fold specificity versus single base 
pair mismatch sites as shown in Table 4. 




3 ' -A C A A T T G T-5 * 

No Specific Binding 



TABLE 4 
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Match Mismatch Specificity 


5'-TGTTAAC A-3 ' 

•oooocx 

3' -AC AAT T G T-5 ' 
K a = 3 X 10 9 


5'-T G Tl 
3' -A C A 

K a = < 


■ A A C A-3' 
C T T G T-5' 

IxlO 8 


8-fold 


5'-TGTGAAC A-3' 

#0#OOCK 

K>00OO0# J 

3'-ACAATTG T-5" 
K a = 3 X 10 9 


5'-T G Tj 

3 '-A C A 
K a = 


■ A A C A-3' 
C T T G T-5' 

4x 10 8 


8-fold 


5'-T G T TAAC A-3" 

toOO(XX 

dOOoOOO^ 

3 ' -A C A A T T G T-5' 
K a = 5 X 10 9 


5 ' ^#^B^OO^~ 3 

3'-ACACTTG T-5 ' 

K a < 1 x 10 8 


>50-fold 


5'-TGTGAAC A-3 

^OOOOOO^ 

3'-ACACTTG T-5 
K a = 1 X 10 10 


5 ■ ^♦o^BoOO-^ -3 

3 ' -A C A A T T G T-5 ' 
K a <lxl0 8 


>100-fold 



The following examples illustrate particular 
embodiments of the present invention and are not limiting 
of the specification and claims in any way. 

5 
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EXAMPLES 
Synthesis of Polyamides 

A. Materials 

Boc-p-alanine- ( -4 -carboxamidomethyl) -benzyl-ester- 
5 copoly (styrene-divinylbenzene) resin (Boc-P-Pam-Resin) , 
dicyclohexylcarbodiimide (DCC) , hydroxybenzotriazole 

(HOBt) , 2- (IH-benzotriazole-l-yl) -1,1,3,3- 

tetramethyluronium hexa-f luorophosphate (HBTU) , Boc- 
glycine, and Boc-p-alanine were purchased from Peptides 

10 International. 2tf,iV-diisopropylethylamine (DIEA) , N,N- 
dimethylformamide (DMF) , N-methylpyrrolidone (NMP) , and 
DMSO/NMP were purchased from Applied Biosystems . Boc-y- 
aminobutyric acid was from NOVA Biochem, dichloromethane 
(DCM) and triethylamine (TEA) was reagent grade from EM, 

15 thiophenol (PhSH) , dimethyl ami nopropyl amine, 

trichloroacetyl chloride, N-methylpyrrole, and N- 
methylimidazole from Aldrich, and trif luoroacetic acid 
(TFA) from Halocarbon. All reagents were used without 
further purification. 

20 X H NMR were recorded on a GE 300 instrument operating 

at 300 MHz. Chemical shifts are reported in ppm relative 
to the solvent residual signal. UV spectra were measured on 
a Hewlett-Packard Model 8452A diode array 

spectrophotometer. IR spectra were recorded on a Perkin- 

25 Elmer FTIR spectrometer. High- resolution FAB mass spectra 
were recorded at the Mass Spectroscopy Laboratory at the 
University of California, Riverside. Matrix- assisted, 
laser desorption/ionization time of flight mass 
spectrometry was carried out at the Protein and Peptide 

30 Microanalytical Facility at the California Institute of 
Technology. HPLC analysis was performed either on a HP 
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10.90M analytical HPLC or a Beckman Gold system using a 
RAINEN C 18 , Microsorb MV, 5/zm, 300 x 4 . 6 mm reversed phase 
column in 0.1% (wt/v) TFA with acetonitrile as eluent and a 
flow rate of l.o mL/min, gradient elution 1.25% 
acetonitrile/min. Preparatory HPLC was carried out on a 
Beckman HPLC using a Waters DeltaPak 25 x 100 mm , 100/xm C 18 
column equipped with a guard, 0.1% (wt/v) TFA, 0.25% 
acetonitrile/min. 18MQ water was obtained from a Millipore 
MilliQ water purification system, and all buffers were 
0.2jtm filtered. Thin- layer chromatography (TLC) was 
performed on silica gel 60 F 2S 4 precoated plates. Reagent- 
grade chemicals were used unless otherwise stated. 



15 B. SYNTHESIS OF BOC-PROTECTED PYRROLE AND IMIDAZOLE 
MONOMER 

1. 4 -Nitro- 2- trichloroacetyl -1-methylpyrrole: 
0 2 N s 

// W 



20 



25 




I 6 

To a well stirred solution of trichloroacetyl chloride 
(1 kg, 5.5 mole) in 1.5 liter ethyl ether in a 12 liter 
flask was added dropwise over a period of 3 h a solution of 
W-methylpyrrole (0.45 kg, 5.5 mole) in 1.5 liter anhydrous 
ethyl ether. The reaction was stirred for an additional 3 
hours and quenched by the dropwise addition of a solution 
of 400 g potassium carbonate in 1.5 liters water. The 
layers were separated and the ether layer concentrated in 
vacuo to provide 2- (trichloroacetyl) pyrrole (1.2 kg, 5.1 
mol) as a yellow crystalline solid sufficiently pure to be 
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used without further purification. To a cooled (-40°C) 
solution of 2- (trichloroacetyl) pyrrole (1.2 kg, 5.1 tnol) 
in acetic anhydride (6 L) in a 12 L flask equipped with a 
mechanical stirrer was added 440 mL fuming nitric acid over 
5 a period of 1 hour while maintaining a temperature of (- 
4 0°C) . The reaction was carefully allowed to warm to room 
temperature and stir an additional 4 h. The mixture was 
cooled to -30 °C, and isopropyl alcohol (6 L) added. The 
solution was stirred at -20 °C for 30 min during which time 

10 a white precipitate forms. The solution was allowed to 
stand for 15 min and the resulting precipitate collected by 
vacuum filtration to provide 4-Nitro-2-trichloroacetyl-l- 
methyl pyrrole. (0.8 kg, 54 % yield) TLC (7:2 benzene/ethyl 
acetate) Rf 0.7; *H NMR (DMSO-d 6 ) 5 8.55 (d # 1 H, J ~ 1.7 

15 Hz), 7.77 (d, 1 H, J = 1.7 Hz), 3.98 (s, 3 H) ; 13 C NMR 
(DMS0-d 6 ) 5 173.3, 134.7, 133.2, 121.1, 116.9, 95.0, 51.5; 
IR (KBr) 1694, 1516, 1423, 1314, 1183, 1113, 998, 750. 
FABMS m/e 269.936 (M + H 269.937 calc. for Ct^^C^C^) . 

20 2. Methyl 4-nitropyrrole-2 -carboxylate : 



To a solution of 4-Nitro-2-trichloroacetyl-l- 
methylpyrrole (800 g, 2.9 mol) in 2.5 L methanol in a 4 L 
Erlenmeyer flask equipped with a mechanical stirrer was 
25 added dropwise a solution of NaH (60% dispersion in oil) 
(10g, 0.25 mol) in 500 mL methanol. The reaction was 
stirred 2 h. at room temperature, and quenched by the 
addition of cone, sulfuric acid (25 mL) . The reaction was 
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then heated to reflux, allowed to slowly cool to room 
temperature as methyl 4-nitropyrrole-2-carboxylate 
crystallizes as white needles which were collected by 
vacuum filtration and dried in vacuo. (450 g, 47% yield) . 
5 TLC (ethyl acetate) Rf 0.8; X H NMR (DMSO-d s ) 5 8.22 (d, 1 H, 
J = 1.7 Hz), 7.22 (d, 1 H, J = 1.6 Hz), 3.88 (s, 3 H) , 3.75 
(s, 3 H); 13 C NMR (DMS0-d 6 ) 6 37.8, 52.2, 112.0, 123.0, 
129.9, 134.6, 160.3; IR(KBr) 3148, 1718, 1541, 1425, 1317, 
1226, 1195, 1116, 753. FABMS m/e 184.048 (M + H 184.048 
10 calc. for C 7 H 8 N 2 04) • 



Methyl 4-amino-l-methyl-pyrrole-2-carboxylate 
hydrochloride 




Methyl-4-nitropyrrole-2-carboxylate (450g, 2.8 mol) 
was dissolved in ethyl acetate (8 L) . A slurry of 40 g of 
10% Pd/C in 800 mL ethyl acetate was then added and the 
mixture stirred under a slight positive pressure of 
hydrogen (c.a. 1.1 atm) for 48 h. Pd/C was removed by 
filtration through Celite, washed 1 x 50 mL ethyl acetate, 
and the volume of the mixture reduced to c.a. 500 mL. 7 L 
of cold ethyl ether was added and HC1 gas gently bubbled 
through the mixture. The precipitated amine hydrochloride 
was then collected by vacuum filtration to yield (380 g, 
81.6 %) of Methyl 4-amino-l-methyl-pyrrole-2-carboxylate 
hydrochloride. as a white powder. TLC (ethyl acetate) 
Rf (amine) 0.6, Rf salt (0.0), 'h NMR (DMSO-d 6 ) 8 10.23 (br 
S, 3H), 7.24 (d, 1H J = 1.9), 6.79 (d, 1H, J = 2.0), 3.83 
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(S-, 3H) , 3.72 (s, 3H) 13 C NMR (DMSO-d 6 ) 8 160.8, 124.3, 

121.2, 113.4, 112.0, 51.8, 37.1; IR (KBr) 3095, 2693, 1709, 

1548, 1448, 1266, 1102, 802, 751. FABMS *m/e 154.075 
(154.074 calc. for C 7 H 10 N 2 O2) - 



10 amino- 1 -methyl -pyrrole-2-carboxylate hydrochloride (340 g, 
1.8 mol) was dissolved in 1 L of 10% aqueous sodium 
carbonate in a 3 L flask equipped with a mechanical 
stirrer, di - t-butyldicarbonate (400 g, 2.0 mmol) slurried 
in 500 mL of dioxane was added over a period of thirty min 

15 maintaining a temperature of 20 °C. The reaction was 
allowed to proceed for three h and was determined complete 
by TLC, cooled to 5°C for 2 h and the resulting white 
precipitate collected by vacuum filtration. The Boc-pyrrole 
ester contaminated with Boc-anhydride was dissolved in 700 

20 mL MeOH, 700 mL of 2M NaOH was added and the solution 
heated at 60 °C for 6 h. The reaction was cooled to room 
temperature, washed with ethyl ether (4 x 1000 mL) , the pH 
of the aqueous layer reduced to c.a. 3 with 10% (v/v) H 2 S0 4 , 
and extracted with ethyl acetate (4 x 2000 mL) . The 

25 combined ethyl acetate extracts were dried (sodium sulfate) 
and concentrated in vacuo to provide a tan foam. The foam 
was dissolved in 500 mL of DCM and 2 L petroleum ether 
added, the resulting slurry was concentrated in vacuo. The 



5 



4- [ (tert-Butoxycarbonyl) amino] -l-methylpyrrole-2- 
carboxylic acid 
BOCHN 




The hydrochloride salt of the pyrrole amine Methyl 4- 
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reaction was redissolved and concentrated three 
additional times to provide (320 g, 78 % yield) of 4- 
[ ( tert-Butoxycarbonyl) amino] -l-methylpyrrole-2-carboxylic 
acid as a fine white powder. TLC (7:2 benzene/ethyl 
5 acetate v/v) Rf (ester) 0.8, Rf (acid) 0.1. (ethyl 
acetate), Rf (acid) 0.6, X H NMR (DMS0-d 6 ) 5 12.10 (s, 1H) , 
9.05 ( S/ 1H), 7.02 (s, 1H) , 6.55 (s, 1H) , 3.75 (s, 3H) , 
1.41 (s, 9H) 13 C NMR (DMSO-d 6 ) 5 162.4, 153.2, 123.3, 
120.1, 119.2, 107.9, 78.9, 36.6, 28.7.; IR(KBr) 3350, 
10 2978, 1700, 1670, 1586, 1458, 1368, 1247, 1112, 887, 779. 
FABMS m/e 241.119 (M +H 241.119 calc. for CuH^CX,) . 



Boc-Py-acid, 4- [ ( tert-Butoxycarbonyl ) amino] -1- 
methylpyrrole-2-carboxylic acid (31 g, 129 mmol) was 
dissolved in 500 mL DMF, HOBt (17.4 g, 12 9 mmol) was 

25 added followed by DCC (34 g, 129 mmol) . The reaction was 
stirred for 24 h and then filtered dropwise into a well 
stirred solution of 5 L of ice water. The precipitate was 
allowed to sit for 15 min at 0 °C and then collected by 
filtration. The wet cake was dissolved in 500 mL DCM, and 

30 the organic layer added slowly to a stirred solution of 
cold petroleum ether (4 °C) . The mixture was allowed to 
stand at -20 °C for 4 h and then collected by vacuum 
filtration and dried in vacuo to provide (39 g, 85% 
yield) of 1,2,3- 



5. 



1, 2, 3-Benzotriazol-l-yl 



4- [ (tert- 



butoxycarbonyl) - amino] -l-methylpyrrole-2- 
carboxylate 
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Benzotriazol-l-yl 4- [ ( tert-butoxycarbonyl) -amino] -1- 

methylpyrrole-2-carboxylate as a finely divided white 
powder. TLC (7:2 benzene/ ethyl acetate v/v) Rf" 0.6, J H NMR 

(DMSO-d 6 ) 5 9.43 (s, 1H), 8.12 (d, 1H, J = 8.4 Hz), 7.80 
5 (d, 1H, J = 8.2Hz), 7.64 (t, 1H, J = 7.0 Hz), 7.51 (m, 
2H) , 7.18 (s, 1H) , 3.83 (s, 3H) , 1.45 (s, 9H) , 13 C NMR 

<DMSO-d 6 ) 5 156.5, 153.3, 143.2, 129.6, 129.2, 125.7, 125.2, 
124.6, 120.3, 112.8, 110.3, 109.8, 79.5, 36.8, 28.6.; IR 

(KBr) 3246, 3095, 2979, 1764, 1711, 1588, 1389, 1365, 1274, 
10 1227, 1160, 1101, 999, 824, 748.; FABMS m/e 358.152 (M + H 
358.151 calc. for C 17 H2oN 5 0 4 ) . 

6. Ethyl l-methylimidazole-2-carboxylate. 




o 

15 N- methyl imidazole (320 g, 3.9 mol) was combined with 2 

L acetonitrile and 1 L triethylamine in a 12 L flask 
equipped with a mechanical stirrer and the solution cooled 
to -20 °C. Ethyl chloroformate (1000 g, 9.2 mol) was added 
with stirring, keeping the temperature between -20 °C and - 

20 25 °C. The reaction was allowed to slowly warm to room 
temperature and stir for 3 6 h. Precipitated triethylamine 
hydrochloride was removed by filtration and the solution 
concentrated in vacuo, at 65 °C. The resulting oil was 
purified by distillation under reduced pressure (2 torr, 

25 1 02 °C) to provide Ethyl l-methylimidazole-2-carboxylate 
as a white solid (360 g, 82 % yield). TLC (7:2 benzene/ 
ethyl acetate v/v) Rf 0.2; X H NMR (DMSO-d 6 ) 6 7.44 (d, 1 H, 
J = 2.8 Hz), 7.04 (d, 1 H, J = 2.8 Hz), 4.26 (q, 2 H, J" = 
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3.5 Hz), 3.91 (s, 3 H), 1.26 (t, 3 H, J = 3.5 Hz); 13 C NMR 
(DMSO-d 6 ) 6 159.3, 129.1, 127.7, 61.0, 36.0, 14.5; IR(KBr) 
3403, 3111, 2983, 1713, 1480, 1422, 1262, 1134, 1052, 922, 
782, 666; FABMS m/e 155.083 (M + H 155.083 calc. for 
5 C 7 H 11 N 2 0 2 ) . 



10 



15 



20 



25 



7. Ethyl l-methyl-4-nitroimidazole-2-carboxylate: 
0 2 N 



N 




6 



Ethyl l-methylimidazole-2-carboxylate was carefully 
dissolved in 1000 mL of concentrated sulfuric acid cooled 
to 0°C. 90% nitric acid (1 L) was slowly added maintaining 
a temperature of 0 °C. The reaction was then refluxed with 
an efficient condenser (-20 °C) in a well ventilated hood 
for 50 min. The reaction was cooled with an ice bath, and 
quenched by pouring onto 10 L ice. The resulting blue 
solution was then extracted with 20 L DCM, the combined 
extracts dried (sodium sulfate) and concentrated in vacuo 
to yield a tan solid which was recrystallized from 22 L of 
21:1 carbon tetrachloride/ethanol . The resulting white 
crystals are collected by vacuum filtration to provide pure 
Ethyl l-methyl-4-nitroimidazole-2-carboxylate. (103 g, 22% 
yield). TLC (7:2 benzene/ ethyl acetate v/v) Rf 0.5, X H NMR 
(DMSO-d 6 ) 5 8.61 (s, 1 H), 4.33 (q, 2 H, J = 6.4 Hz), 3.97 
(s, 3 H) , 1.29 (t, 3 H, J = 6.0 Hz), 13 C NMR (DMSO-d 6 ) 6 
158.2, 145.4, 135.3, 127.4, 62.2, 37.3, 14.5; IR(KBr) 3139, 
1719, 1541, 1498, 1381, 1310, 1260, 1122, 995, 860, 656.; 
FABMS m/e 200.066 (M +H 200.067 calc. for C 7 H 10 N 3 O 4 ) . 
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Ethyl 4-amino-l-methylimidazole-2-carboxylate 
hydrochloride 



15 




o 



5 The nitro imidazole ethyl ester Ethyl 1 -methyl -4- 

nitroimidazole-2-carboxylate (103g, 520 mmol) was dissolved 
in 5 L off 1:1 ethanol/ethyl acetate, 20g 10% Pd/C slurried 
in 500 mL ethyl acetate was added and the mixture stirred 
under a slight positive pressure of hydrogen (c.a. l.i a tm) 
10 for 48 h. The reaction mixture was filtered, concentrated 
in vacuo to a volume of 500 mL and 5 L of cold anhydrous 
ethyl ether added. Addition of HC1 gas provided a white 
precipitate. The solution was cooled at -20 °C for 4 h and 
the precipitate collected by vacuum filtration and dried in 
vacuo to provide (75 g, 78% yield) of ethyl 4 -amino- 1- 
methylimidazole-2-carboxylate hydrochloride as a fine white 
powder. TLC (7:2 benzene: ethyl acetate) R f (amine) 0.3, R f 
(salt) 0.0. X H NMR (DMS0-d 6 ) 8 10.11 (br s, 3H) , 7.43 (s, 
1H), 4.28 (q, 2H, J= 7.1Hz), 3.92 (s, 1H) , 1.28 (t, 3H, J 
20 = 7.1 Hz) 13 C NMR (DMS0-d 6 ) 8 157.6, 132.6, 117.4, 117.3, 
61.8, 36.6, 14.5; IR(KBr) 3138, 2883, 1707, 1655, 1492, 
1420, 1314, 1255, 1152, 1057, 837, 776.; FABMS m/e 169.085 
(169.084 calc. for C7HnN 3 0 2 ) . 
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9. 4- [ (tert-butoxycarbonyl) amino] -1-methylimidazole- 
2-carboxylic acid 



10 



15 



20 



25 




N 



O 



The imidazole amine ethyl 4-amino-l-methylimidazole- 
2-carboxylate hydrochloride (75 g, 395 mmol) was dissolved 
in 200 mL DMF. DIEA (45 mL, 491 mmol) was added followed by 
di-t-butyldicarbonate (99 g, 491 mmol). The mixture was 
shaken at 60 °C for 18 h, allowed to assume room 
temperature, and partitioned between 500 mL brine, 500 mL 
ethyl ether. The ether layer was extracted (2 x 200 mL 
each) 10% citric acid, brine, satd. sodium bicarbonate, 
brine, dried over sodium sulfate and concentrated in vacuo 
to yield the Boc-ester contaminated with 20% Boc-anhydride 
as indicated by *H NMR. The Boc-ester, used without further 
purification, was dissolved in 200 mL 1M NaOH. The reaction 
mixture was allowed to stand for 3 h at 60 °c with 
occasional agitation. The reaction mixture was' cooled to 0 
°C, and carefully neutralized with 1 M HC1 to pH 2, at 
which time a white gel forms. The gel was collected by 
vacuum filtration, frozen before drying, and remaining 
water lyophilized to yield 4- [ (tert-butoxycarbonyl) amino] - 
l-methylimidazole-2-carboxylic acid as a white powder. (51 
g, 54% yield). X H NMR (DMSO-d 6 ) 6 9.47 (s, 1H) , 7.13 (s, 
1H), 3.85 (s, 3H), 1.41 (s, 9H) . 13 C NMR (DMS0-d 6 ) 5 160.9, 
152.9, 137.5, 134.5, 112.4, 79.5, 35.7, 28.6; IR(KBr) 3448, 
2982, 1734, 1654, 1638, 1578, 1357, 1321, 1249, 1163, 799.; 
FABMS m/e 241.105 (241.106 calc. for C 10 H 15 N 3 O 4 ) . 
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10. y- [ (tert-butoxycarbonyl) amino] -butyric acid - (4- 
carboxamido-1 -methyl -imidazole) -2 -carboxylic acid 

o 

To a solution of Boc-y-aminobutyric acid (10 g, 49 
5 mmol) in 40 mL DMF was added 1.2 eq HOBt (7.9 g # 59 mmol) 
followed by 1.2 eq DCC (11.9 g, 59 mmol). The solution was 
stirred for 24 h, and the DCU removed by filtration. 
Separately, to a solution of ethyl 4-nitro-l- 
methylimidazole-2-carboxylate (9.8 g, 49 mmol) in 20 mL DMF 

10 was added Pd/C catalyst (10%, 1 g) , and the mixture was 
hydrogenated in a Parr bom apparatus (500 psi H 2 ) for 2 h. 
The catalyst was removed by filtration through celite and 
filtrate immediately added to the -OBt ester solution. An 
excess of DIEA (15 mL) was then added and the reaction 

15 stirred at 37 °C for 48 h. The reaction mixture was then 
added dropwise to a stirred solution of ice water and the 
resulting precipitate collected by vacuum filtration to 
provide crude ethyl y- [ [ (tert- butoxy) carbonyl] amino] - 
butyric acid - (4 -carboxamido-1 -methyl -pyrrole) -2- 

20 carboxylate (5 g, 14.1 mmol). To the crude ester dissolved 
in 50 mL methanol was added 50 mL 1M KOH and the resulting 
mixture allowed to stir for 6 h at 3 7°C. Excess methanol 
was removed in vacuo and the resulting solution acidified 
by the addition of 1 M HC1 . The resulting precipitate was 

25 collected by vacuum filtration and dried in vacuo to yield 
y- [ (tert-butoxycarbonyl) amino] -butyric acid - (4-carboxamido- 
1 -methyl -imidazole) -2-carboxylic acid as a brown powder. 
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(4.4g, 89% yield). »H NMR (DMSO-d 6 ) 5 10.50 (s, 1 H) , 7.45 
(8. 1 H), 6.82 (t, 1 H. J = 3.6 Hz), 3.86 (s, 3 H) , 2.86 
(q, 2 H, J = 4.6 Hz), 2.22 (t, 2 H, J = 7 .4 Hz), 1.57 
(quintet, 2 H, J = 5.9 Hz), 1.29 (s, 9 H) ; IR 3416, 2950, 
5 2841, 1650,1538 1449, 1392, 1250, 1165, 1108; FABMS m/e 
326.160 (326.159 calc. for C 14 H 22 N 4 O s ) . 

11. 4- [ (tert-butoxycarbonyl) amino] -l-methylpyrrole-2- 
(4-carboxamido-l-methyl imidazole) -2-carboxylic 



10 acid 



15 



was 



H 

1 ° W oh 

1 O 

4- [ (tert-butoxycarbonyl) amino] -l-methylpyrrole-2- (4- 
carboxamido-1 -methyl imidazole) -2-carboxylic acid 
prepared as described below for y- [ (tert-butoxycarbonyl) - 
amino] -butyric acid - (4 -carboxamido- 1 -methyl -imidazole) -2- 
carboxylic acid substituting Boc-Pyrrole acid for Boc-y- 
aminobutyric acid. (4.1 g, 91% yield). *H NMR (DMSO-d 6 ) 5 
10.58 (s, 1 H), 9.08 (s, 1 H) , 7.57 (s, 1 H) . 6.97 (s, 1 
H), 6.89 (s, 1 H), 3.89 (s, 3 H) , 3.75 (s, 3 H) , 1.35 (s, 9 
20 H); 13 C NMR (DMSO-d 6 ) 8 160.36, 159.1, 153.4, 137.9, 132.3, 
122.8, 122.3, 118.5, 115.5, 105.5, 105.4, 78.8, 28.7, 24.9; 
IR 3346, 2929, 1685, 1618, 1529, 1342, 1274, 1179, 997, 
761. FABMS m/e 364.161 (364.162 calc. for C 16 H 22 N 5 0 5 ) . 
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C. 



Solution Phase Synthesis of Polyamides Using Boc- 
Protected Pyrrole and Imidazole Building Blocks. 
1 . Aminohexa- (tf-methylpyrrolecarboxamide j 
ditrif luoroacetate . 



BocHN 




N 

I 6 



1 ) HBTU/DIEA 
,OH 2) H2NPyPyPy-Dp, DIEA 



^•CF^COOH 

V 
I 6 





Fx 



N 

I O 



I • CFjCOOH 



10 



15 



20 



25 



To a solution of N- ( tert-butoxycarbonyl) -tris (N- 
methylpyrrolecarboxamide) (20 mg, 41 /zmol) in DMF (100/il) 
was added HBTU (26 mg, 69 /zmol) followed by DIEA (50 /il, 
288 /zmol) . The reaction was allowed to stand for 5 
minutes, agitated, and allowed to stand for an additional 
five minutes. Aminotris- (N-methylpyrrolecarboxamide) (24 
mg, 41 /imol) was then added followed by DIEA (50 /il, 288 
/zmol) and the reaction agitated for 2 hours. The reaction 
mixture was concentrated in vacuo and TFA (10 ml) added. 
After 2 minutes the TFA was removed in vacuo. Purification 
of the resulting brown oil by reversed phase HPLC afforded 
the diamine aminohexa- (iV-methylpyrrolecarboxamide) 

ditrif luoroacetate as a white powder. Yield: 26 mg (58%); 
X H NMR (DMSOds) 5 10.06 (s, 1 H) , 9.95 (m, 2 H) , 9.91 (s, 1 
H), 9.84 (s, 1 H), 9.44 (br s, 1 H) , 8.16 (t, 1 H, J = 4.0 
Hz), 7.22 (m, 4 H), 7.16 (d, 1 H, J = 1.7 Hz), 7.10 (s, 1 
H, J = 1.7 Hz), 7.07 (m, 3 H) , 6.98 (s, 1 H, J = 1.7 Hz), 
6.93 (s, 1 H, J = 1.8 Hz), 3.88 (m, 6 H) , 3.84 (m, 12 H) , 
3.79 (m, 6 H), 3.21 (m, 2 H) , 3.04 (m, 2 H) , 2.77 (d, 6 H, 
J = 4.8 Hz), 1.80 (m, 2 H) ; FABMS m/e 835.412 (M + H, 
835.416 calc. for C 4 iH 51 N 14 0 6 ) . 
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2 . ImPyPyPyPyPypy-Dp 




N-methyl-Imidazole-2-carboxylic acid (100 mg, 741 
/imol) and HOBt (72 mg, 500 /imol) were suspended in 500 /il 
5 DMF. Upon addition of DCC (100 mg, 500 /imol) the reaction 
mixture became a homogeneous solution. The activation was 
allowed to stand for 12 hours, precipitated 
dicyclohexylurea removed by filtration and Aminohexa- (N- 
methylpyrrolecarboxamide) ditrif luoroacetate (10 mg, 9.4 
10 /imol) added followed by DIEA (100 /il, 576 /imol), and the 
reaction allowed to stand for 2 hours. Reversed phase HPLC 
purification of the reaction mixture afforded 
ImPyPyPyPyPyPy-Dp as a white powder. Yield: 6.3 mg (62%); 
HPLC, r.t. 27.4 min; UV^ (e) , 246 (34,100), 304 (56,600) 
•15 nm; X H NMR (DMSO-d*) 5 10.46 (s, 1 H) , 9.55 (s, 1 H) , 9.94 
(m, 3 H) , 9.90 (s, 1 H) , 9.20 (br s, 1 H) , 8,14 (t, 1 H, J 
= 7.2 Hz), 7.38 (s, 1 H) , 7.28 (d, 1 H, J = 1.4 Hz), 7.26 
(d, 1 H, J = 1.4 Hz), 7.23 (m, 4 H) , 7.08 (m, 5 H) , 7.04 
(s, 1 H, J = 1.2 Hz), 6.93 (d, 1 H, J = 1.6 Hz), 3.98 (s, 3 
20 H), 3.84 (m, 15 H) , 3.83 (s, 1 H) , 3.30 (q, 2 H, J = 7.4 
Hz), 3.21 (t, 2 H, J = 7.1 Hz), 2.77 (d, 6H, J = 4.1 Hz), 
1.74 (m, 2 H) ; MALDI-TOF MS 944.21 (M + H 944.04 calc); 
FABMS m/e 965.430 (M + Na , 965.426 calc. for C 46 H 5 4N 16 0 7 Na) . 
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D.. Solid Phase Syntheses 

1. Activation of Imidazole- 2 -carboxylic acid, Boc-y- 
aminobutyric acid, Boc-glycine, and Boc-P-alanine 

The appropriate amino acid or acid (2 -mmol) was 
5 dissolved in 2 mL DMF. HBTU (72 0 mg, 1.9 mmol) was added 
followed by DIEA (1 mL) and the solution lightly shaken for 
at least 5 min. 



2. Activation of Boc- Imidazole acid 

Boc imidazole acid (257 mg, 1 mmol) and HOBt (135 mg, 
1 mmol) were dissolved in 2 mL DMF, DCC (202 mg, 1 mmol) is 
then added and the solution allowed to stand for at least 5 
min. 



!5 3. Activation of Boc-y-Imidazole acid and Boc- 

Pyrrole- Imidazole acid 

The appropriate dimer (1 mmol) and HBTU (378 mg, 1 
mmol) are combined in 2 mL DMF. DIEA (1 mL) is then added 
and the reaction mixture allowed to stand for 5 min. 

20 

4. Activation of Boc-Pyrrole acid, (for coupling to 
Imidazole amine) 

Boc-Pyrrole acid (514 mg, 2 mmol) was dissolved in 2 
mL dichloromethane, DCC (420 mg, 2 mmol) added, and the 
25 solution allowed to stand for 10 min, DMAP (101 mg, 1 mmol) 
was added and the solution allowed to stand for 1 min. 



5, Acetylation Mix. 

2 mL DMF, DIEA (710 /iL, 4.0 mmol), and acetic 
30 anhydride (380 /iL, 4.0 mmol) were combined immediately 
before use. 
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6. Manual Synthesis Protocol 

Boc-p-alanine-Pam-Resin (1.25 g, 0.25 mrtiol) is placed 
in a 20 mL glass reaction vessel, shaken in DMF for 5 min 
5 and the reaction vessel drained. The resin was washed with 
DCM (2 x 30 s.) and the Boc group removed with 80% 
TFA/DCM/0 . 5M PhSH, 1 x 30s., 1 x 20 min The resin was 
washed with DCM (2 x 30 s.) followed by DMF (1 x 30 s.) A 
resin sample (5 - 10 mg) was taken for analysis. The vessel 

10 was drained completely and activated monomer added, 
followed by DIEA if necessary. The reaction vessel was 
shaken vigorously to make a slurry. The coupling was 
allowed to proceed for 45 min, and a resin sample taken. 
The reaction vessel was then washed with DCM, followed by 

15 DMF. 

7. Machine-Assisted Protocols 

Machine-assisted synthesis was performed on a ABI 430A 
synthesizer on a 0.18 mmol scale (900 mg resin; 0.2 

20 mmol/gram) . Each cycle of amino acid addition involved: 
deprotection with approximately 80% TFA/DCM/0. 4M PhSH for 3 
minutes, draining the reaction vessel, and then 
deprotection for 17 minutes; 2 dichloromethane flow washes; 
an NMP flow wash; draining the reaction vessel; coupling 

25 for 1 hour with in situ neutralization, addition of 
dimethyl sulfoxide (DMSO)/NMP, coupling for 30 minutes, 
addition of DIEA, coupling for 30 minutes; draining the 
reaction vessel; washing with DCM, taking a resin sample 
for evaluation of the progress of the synthesis by HPLC 

30 analysis; capping with acetic anhydride/DIEA in DCM for 6 
minutes; and washing with DCM. A double couple cycle is 
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employed when coupling aliphatic amino acids to imidazole, 
all other couplings are performed with single couple 
cycles. 

The ABI 430A synthesizer was left in the standard 
5 hardware configuration for NMP-HOBt protocols. Reagent 
positions 1 and 7 were DIEA, reagent position 2 was 
TFA/0.5M thiophenol, reagent position 3 was 70% 
ethanolamine/methanol , reagent position 4 was acetic 
anhydride, reagent position 5 was DMSO/NMP, reagent 

10 position 6 was methanol, and reagent position 8 was DMF. 
New activator functions were written, one for direct 
transfer of the cartridge contents to the concentrator 
(switch list 21, 25, 26, 35, 37, 44), and a second for 
transfer of reagent position 8 directly to the cartridge 

15 (switch list 37, 39, 45, 46) . 

Boc-Py-OBt ester (357 mg, 1 mmol) was dissolved in 2 ml 
DMF and filtered into a synthesis cartridge. Boc-Im acid 
monomer was activated (DCC/HOBt) , filtered, and placed in a 
synthesis cartridge. Imidazole-2-carboxylic acid was added 

20 manually. At the initiation of the coupling cycle the 
synthesis was interrupted, the reaction vessel vented and 
the activated monomer added directly to the reaction vessel 
through the resin sampling loop via syringe. When manual 
addition was necessary an empty synthesis cartridge was 

25 used. Aliphatic amino acids (2 mmol) and HBTU (1.9 mmol) 
were placed in a synthesis cartridge. 3 ml of DMF was 
added using a calibrated delivery loop from reagent bottle 
8, followed by calibrated delivery of 1 ml DIEA from 
reagent bottle 7, and a 3 minute mixing of the cartridge. 

30 The activator cycle was written to transfer activated 

monomer directly from the cartridge to the concentrator 
vessel, bypassing the activator vessel. After transfer, 1 
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ml of DIEA was measured into the cartridge using a 
calibrated delivery loop, and the DIEA solution combined 
with the activated monomer solution in the ' concentrator 
vessel. The activated ester in 2:1 DMF/DIEA was then 
i transferred to the reaction vessel. All lines were emptied 
with argon before and after solution transfers. 

8. Stepwise HPLC analysis 

A resin sample (c.a. 4 mg) was placed in a 4 mL glass 
test tube, 200/zL of N, N-dimethylaminopropylamine was added 
and the mixture heated at 100 °C for 5 min. The cleavage 
mixture was filtered and a 25/zL sample analyzed by 
analytical HPLC at 254 nm. 



15 9 ' Typical Manual Synthesis Protocol: PyPyPy-y- 

ImlmPy-p-Dp 

0 



20 




Boc-p-Pam-resin (1.25 g, 0.25 mmol amine) was shaken in 
DMF for 30 min and drained. The N-Boc group removed by 
washing with DCM for 2 x 30 s, followed by a 1 min shake in 
80% TFA/DCM/0 . 5M PhSH, draining the reaction vessel and a 
brief 80% TFA/DCM/ 0.5 M PhSH wash, and 20 min shaking in 
80% TFA/DCM/0. 5M PhSH solution. The resin was washed 1 min 
with DCM and 30 s with DMF. A resin sample (8-10 mg) was 
25 taken for analysis. The resin was drained completely and 
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Boc-pyrrole-OBt monomer (357 mg, 1 mmol) dissolved in 2 ml 
DMF added followed by DIEA (1 ml) and the resin shaken 
vigorously to make a slurry. The coupling was allowed to 
proceed for 45 min. A resin sample (8-10 mg) was taken 
5 after 4 0 min to check reaction progress. The reaction 
vessel was washed with DMF for 30 s and dichloromethane for 
1 min to complete a single reaction cycle. Six additional 
cycles were performed adding, Boclm-OH (DCC/HOBt) , Boclm-OH 
(DCC/HOBt) , Boc-y-aminobutyric acid (HBTU/DIEA) and allowed 

10 to couple for 2 hours, BocPy-OBt, BocPy-OBt, and pyrrole-2- 
carboxylic acid (HBTU/DIEA) . The resin was washed with DMF, 
DCM, MeOH, and ethyl ether and then dried in vacuo. 
PyPyPy-Y-ImlmPy-p-Pam-Resin (180 mg, 29 /imol) 12 was weighed 
into a glass scintillation vial, 1.5 ml of N,N- 

15 dimethylaminopropylamine added, and the mixture heated at 
55 °C for 18 hours. The resin was removed by filtration 
through a disposable polypropylene filter and washed with 5 
ml of water, the amine solution and the water washes 
combined, and the solution loaded on a C 18 preparatory HPLC 

20 column, the column allowed to wash for 4 min in 0.1% TFA at 
8 ml/min, the polyamide was then eluted in 100 min. as a- 
well defined peak with a gradient of 0.25% acetonitrile per 
min. The polyamide was collected in four separate 8 ml 
fractions, the purity of the individual fractions verified 

25 by HPLC and *H NMR, to provide purified PyPyPy-y-ImlmPy-p- 
Dp(11.2 mg, 39% recovery), UV X^*, 246 (31, 100), 312 
(51,200) HPLC, r.t. 23.6, X H NMR (DMSO-d 6 ) 8 10.30 (s, 1 H) , 
10.26 (s, 1 H), 9.88 (s, 1 H) , 9.80 (s , 1 H) , 9.30 (s, 1 
H) , 9.2 (br s, 1 H) , 8.01 (m, 3 H) , 7.82 (br s 1 H) , 7.54 

30 (s, 1 H), 7.52 (s, 1 H), 7.20 (d, 1 H, J = 1.3 Hz), 7.18 
(d, 1 H, J a 1.2 Hz), 7.15 (d, 1 H, J = 1.3 Hz), 7.01 (d, 1 
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H v J = 1.4 Hz), 6.96 (d, 1 H f J » 1.4 Hz), 6.92 (d, 1 H, J 
= 1.8 Hz), 6.89 (m, 2 H) , 6.03 (t, 1 H, J = 2.4 Hz), 3.97 
(s, 3 H) , 3.96 (s, 3 H) , 3.85 (s, 3 H) , 3.82 ( s , 3 H) , 3.78 
(m, 6 H) , 3.37 (m, 2 H) , 3.20 (q, 2 H, J = 5.7 Hz), 3.08 
5 (q, 2 H J = 6.6 Hz), 2.94 (q, 2 H J = 5.3 Hz), 2.71 (d, 6 H 
J = 5.8 Hz), 2.32 (m, 4 H) , 1.83 (m, 4 H) ; MALDI -TOF-MS , 
978.7 (979.1 calc. for M+H) . 



9 . ImlmPy - y - Py Py Py - P - Dp 



10 




Polyamide was prepared by machine assisted solid phase 
synthesis protocols and 900 mg resin cleaved and purifed to 
provide ImlmPy-Y-PyPyPy-p-Dp as a white powder. (69 mg, 48% 
recovery), UV 246 (43,300), 308 (54,200) HPLC, r.t. 

15 23.9, *H NMR (DMSO-d 6 ) 5 10.31 (s, 1 H) , 9.91 (s, 1 H) , 9.90 
(s, 1 H) , 9.85 (s, 1 H) , 9.75 (s, 1 H) , 9.34 (br s, 1 H) , 
8.03 (m, 3 H) , 7.56 (s, 1 H) , 7.46 (s, 1 H) , 7.21 (m, 2 H) , 
7.15 (m, 2 H) , 7.07 (d, 1 H J = 1.2 Hz), 7.03 (d, 1 H, J = 
1.3 Hz), 6.98 (d, 1 H, J = 1.2 Hz), 6.87 (m, 2 H) , 4.02 (m, 

20 6 H) , 3.96 (m, 6 H) , 3.87 (m, 6 H) , 3 .75 (q, 2 H, J = 4.9 
Hz), 3.36 (q, 2 H, J = 4.0 Hz), 3.20 (q, 2 H, J = 4.7 Hz), 
3.01 (q, 2 H J = 5.1 Hz), 2.71 (d, 6H, J = 4.8 Hz), 2.42 
(m, 4 H) , 1.80 (m, 4 H) MALDI -TOF-MS 978.8, (979.1 calc. 
for M+H) 
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10 



15 



10 . AcImlmPy-y-PyPyPy-p-Dp 
O 

A 




o o 

Polyamide was prepared by manual solid phase protocols 
and isolated as a white powder. (8 mg, 28% recovery) , UV 
Kax, 246 (43,400), 312 (50,200) HPLC, r.t. 24.8, X H NMR 
(DMSO-d 6 ) 5 10.35 (s, 1 H), 10.30 (s, 1 H) , 9.97 (s, 1 H) , 
9.90 (s, 1 H), 9.82 (s, 1 H) , 9.30 (s, 1 H) , 9.2 (br s 
1H), 8.02 (m, 3 H) , 7.52 (s, 1 H) . 7.48 (s, 1 H) , 7.21 (tn, 
2H), 7.16 (d, 1 H, J = 1.1 Hz), 7.11 (d, 1 H, J = 1.2 Hz), 
7.04 (d, 1 H, J = 1.1 Hz), 6.97 (d, 1 H, J = 1.3 Hz), 6.92 
(d, 1 H, J = 1.4 Hz), 6.87 (d, 1 H, J = 1.2 Hz), 3.99 (s, 3 
H), 3.97 (s, 3 H), 3.83 (s, 3 H) , 3.82 (s, 3 H) , 3.80 (s, 3 
H), 3.79 (s, 3 H) , 3.47 (q, 2 H, J = 4.7 Hz), 3.30 (q, 2 H, 
J = 4.6 Hz), 3.20 (q, 2 H, J = 5.0 Hz), 3.05 (q, 2 H, J = 
5.1 Hz), 2.75 (d, 6 H, J = 4.1 Hz), 2.27 (m, 4 H) , 2.03 (s, 
3 H), 1.74 (m, 4 H) MALDI -TOF-MS , 1036.4 (1036.1 calc. for 
M+H) . 
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11 ♦ AcPyPyPy-y- imlmPy- p -Dp 

£ t 



protocols as a white powder. (14 



5 recovery), UV X^, 246 (44,400), 312 (52,300) HPLC, r.t. 
23.8, X H NMR (DMSO-d 6 ) 10.32 (s, 1 H) , 10.28 (s, 1 H) , 9.89 
(m, 2 H) , 9.82 (s, 1 H) , 9.18 (s, 1 H) , 9.10 (br s, 1 H) , 
8.03 (m, 3 H), 7.55 (s, 1 H) , 7.52 (s, 1 H) , 7.21 (d, 1 H, 
J = 1.1 Hz), 7.18 (d, 1 H, J = 7.16), 7.15 (d, 1 H, J = 1.0 

10 HZ), 7.12 (d, 1H, J = 1.0 Hz), 7.02 (d, 1 H, J = 1.0 Hz), 
6.92 (d, 1 H, J = 1.1 Hz), 6.87 (d, 1H, J = 1.1 Hz), 6.84 
(d, 1H, J = 1.0 Hz), 3.97 (s, 3 H) , 3.93 (s, 3 H) , 3.87 (s, 
3 H), 3.80 (s, 3 H), 3.78 (m, 6 H) , 3.35 (q, 2 H, J = 5.6 
Hz), 3.19 (q, 2 H, J = 5.3 Hz), 3.08 (q, 2 H, J = 5.7 Hz), 

15 2.87 (q, 2 H, J = 5.8 Hz), 2.71 (d, 6 H, J = 4.0 Hz), 2.33 
(m, 4 H), 1.99 (s, 3 H) , 1.74 (m, 4 H) . MALDI -TOF-MS , 
1036.2 (1036.1 calc for M+H) . 
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12 . ImPyPy-y-PyPyPy-p-Dp 




ImPyPy-y-PyPyPy-fJ-Pam-Resin was prepared by machine- 
assisted synthesis protocols. A sample of resin (l g, 0.17 
5 mmol was placed in a 20 mL glass scintillation vial, 4 mL 
of dimethyl aminopropyl amine added, and the solution heated 
at 55 °C for 18 h. Resin substitution is calculated as 
L new (mmol/g) = L 0 i d /(1 + L c i d (W new - W old ) x 1CT 3 ) ; L is the 
loading, and W is the molecular weight of the polyamide 

10 attached to the resin. (Barlos, et al . Jnt. J. Peptide 
Protein Res. 1991, 37, 513.) Resin is removed by filtration 
through a disposable propylene filter and 16 mL of water 
added. The polyamide/amine mixture was purified directly 
by preparatory HPLC and the appropriate fractions 

15 lyophylized to yield a white powder. (103 mg, 61 % 
recovery) HPLC r.t. 24.1, UV Kaxi^O) (e) , 234 nm (39,300), 
304 nm (52,000); X H NMR (DMSO-d 6 ) ; 10.47 (s, 1 H) , 9.91 (s, 
1 H), 9.89 (s, 1 H), 9.87 (s, 1 H) , 9.84 (s, 1 H) , 9.2 (br 
s, 1 H), 8.08 (m, 3 H) , 7.38 (s, 1 H) , 7.26 (d, 1 H, J = 

20 1.0 Hz), 7.20 (d, 1 H, J = 1.0 Hz), 7.14 (m, 4 H) , 7.04 (d, 
1 H, J = 1.1 Hz), 7.02 (d, 1 H, J = 1.1 Hz), 6.89 (d, 1 
H, J = 1.0 Hz), 6.85 (m, 2 H) , 3.97 (s, 3 H) , 3.82 (m, 6 
H) , 3.81 (s, 3 H) , 3.77 (m, 6 H) , 3.34 (m, 2 H, J = 3.9 
Hz), 3.18 (m, 2 H, J = 5.5 Hz), 3.06 (m, 2 H, J = 5.7 Hz), 
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2.95 (m, 2 H, J = 4.9 Hz), 2.71 (d, 6 H, J = 4.6 Hz), 2.30 

(m, 6 H) , 1.75 (m, 4 H) ; MALDI-TOF MS 978.0 (978.1 calc. 
for M+H) . 



5 13. ImPyPy-y-PyPypy-G-Dp 




ImPyPy-y-PyPyPy-G-Dp was prepared as described for 
ImPyPy-y-PyPyPy-p-Dp. (12 mg, 40% recovery). HPLC, r.t. 
26.9, UV (H 2 0) , 246 (41,100), 306 (51,300) X H NMR (DMSO- 

10 d 6 ) 6 10.50 (s, 1 H) , 9.95 (s, 1 H) , 9.93 (s, 1 H) , 9.92 (s, 
1 H) , 9.86 (s, 1 H) , 9.2 (br S, 1H) , 8.29 (t, 1 H, J = 4.4 
Hz), 8.07 (t, 1 H, J = 5.2 Hz), 8.03 (t, 1 H, J = 5.4 
Hz), 7.39 (s, 1 H) , 7.27, (d, 1 H, J = 1.6 Hz), 7.22 (m, 2 
H) , 7.16 (m, 2 H) , 7.04 (m, 2 H) , 6.92 (d, 1 H, J = 1.6 

15 Hz), 6.89 (d, 1 H, J = 1.7 Hz), 6.86 (d, 1 H, J = 1.6 Hz), 
3.97 (s, 3 H) , 3.82 (m, 6 H) , 3.81 (s, 3 H) , 3.78 (m, 6 H) , 
3.70 (d, 2 H, J = 5.7 Hz), 3.20 (q, 2 H, J = 5.7), 3.11 
(q, 2 H, J = 4.2 Hz), 3.00 (q, 2 H, J = 4.4 Hz), 2.76 (d, 
6 H, J = 4.7 Hz), 2.24 (t, 2 H, J = 4.8 Hz), 1.77 (m, 4 

20 H) ; MALDI-TOF -MS, 964.3 (964.1 calc. for M+H) . 
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14. Ac ImPy Py -y - Py Py Py - G - Dp 

fl i 



/ 



i 6 



AcImPyPy-y-PyPyPy-G-Dp was prepared as described for 
ImPyPy-y-PyPyPy-p-Dp. (13.1 mg, 30% yield) HPLC, r.t. 24.0, 
5 UV ^ (H 2 0) , 246 (35,900), 312 (48,800) X H NMR (DMSO-d 6 ) 8 
10.23 (s, 1 H), 9.98 (s, 1 H) , 9.32 (s, 1 H) , 9.90 (m, 2 
H), 9.84 (s, 1 H), 9.2 (br s, 1 H) , 8.27 (t, 1 H, J = 5.0), 
8.05 (m, 2 H), 7.41 (s, 1 H) , 7.25 (d, 1 H, J = 1.4 Hz), 
7.22 (m, 2 H), 7.16 (m, 2 H) , 7.12 (d, 1 H, J = 1.7 Hz), 

10 7.05 (d, 1 H, J = 1.5 Hz), 6.94 (d, 1 H, J = 1.6 Hz), 6.89 
(d, 1 H, J = 1.7 Hz) 6.87 (d, 1 H, J = 1.6 Hz), 3.93 (s, 3 
H), 3.83 (s, 3 H), 3.82 (m, 6 H) , 3.81 (s, 3 H) , 3.79 (s, 3 
H), 3.71 (d, 2 H, J = 5.1 Hz),' 3.19 (q, 2 H, J = 5.8 Hz), 
3.12 (q, 2 H, J = 5.0 Hz), 3.01 (q, 2 H, J = 4.2 Hz), 2.74 

15 (d, 6 H, J = 4.6 Hz), 2.26 (t 2 H, J = 4.6 Hz), 2.00 (s, 3 
H) , 1.75 (m, 4 H) ; MALDI -TOF-MS , 1021.6 (1021.1 calc. for 
M+H) . 
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Ac ImPyPy -y- PyPyPy - p - Dp 

2 i 



AcImPyPy-y-PyPyPy-p-Dp was prepared as described for 
ImPyPy-y-PyPyPy-p-Dp. (9.2 mg 31% yield), UV X«» (H 2 0) , 246 
5 (42,800), 312 (50,400) HPLC, r.t. 24.9, : H NMR (DMSO-d 6 ) 8 
10.25 (s, 1 H) , 10.01 (s, 1 H) , 9.92 (m, 3 H) , 9.86 (s, 1 
H), 9.3 (br s, 1 H) , 8.10 (m, 3 H) , 7.42 (s, 1 H) , 7.25 (d, 
1 H, J = 1.5 Hz) , 7.20 (d, 1 H, J = 1.6 Hz) , 7.16 (m, 3 H) , 
7.12 (d, 1 H, J = 1.4 Hz), 7.03 (d, 1 H J = 1.7), 6.89 (d, 

10 1 H, J = 1.6 Hz), 6.86 (m, 2 H) , 3.92 (s, 3 H) , 3.83 (s, 3 
H), 3.82 (s, 3 H), 3.80 (s, 6H) , 3.78 (s, 3 H) , 3.35 (q, 2 
H, J = 5.5 Hz), 3.20 (q, 2 H, J = 3.8 Hz), 3.08 (q, 2 H, J 
= 3.3 Hz), 2.97 (q, 2 H, J = 3.8 Hz), 2.75 (d, 6 H J = 4.8 
Hz), 2.34 (t, 2 H, J = 5.0 Hz), 2.24 (t, 2 H, J = 4.4 Hz), 

15 2.00 (s, 3 H) , 1.71 (m, 4 H) ; MALDI -TOF-MS , 1035.4 (1035.1 
calc. for M+H) . 
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16. Imlmlm-y-PyPyPy-p-Dp. 




The product was synthesized by manual solid phase 
protocols and recovered as a white powder (2.4 mg, 4% 
5 recovery). UV ^ ax 312 (48,500); X H NMR (DMSO-d 6 ) d 10.09 
(s, 1 H) , 9.89 (s, 1 H), 9.88 (s, 1 H) , 9,83 (s, 1 H) , 9.57 
(s, 1 H), 9.19 (br s, 1 H), 8.36 (t, 1 H, J = 5.6 Hz), 8.03 
(m, 2 H), 7.64 (s, 1 H) , 7.51 (s, 1 H) , 7.45 (s, 1 H) , 7.20 
(d, 1 H, J = 1.0 Hz), 7.15 (d, 1 H, J = 2.0 Hz), 7.14 (s, 1 

10 H) , 7.08 (s, 1 H) , 7.04 (s, 1 H) , 6.87 (d, 2 H, J = 2.2 
Hz), 4.01 (s, 3 H), 3.99 (s, 3 H) , 3.95 (s, 3 H) , 3.82 (s, 
3 H) 3.82 (s, 3 H) , 3.79 (s, 3 H) , 3.37 (q, 2 H, J = 5.8 
Hz), 3.26 (q, 2 H, J = 6.1 Hz), 3.10 (q, 2 H, J = 6.1 Hz), 
2.99 (m, 2 H) , 2.73 (d, 6 H, J = 4.8 Hz), 2.34 (t, 2 H, J = 

15 7.2 Hz), 2.27 (t, 2 H, J = 7.3 Hz), 1.79 (m, 4 H) ; MALDI- 
TOF-MS, 980.1 (980.1 calc. for M+H) . 
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17 . Imlmlm-y- PyPyPy- P -Dp-NH 2 : 




A sample of machine -synthesized resin (350 mg, 0.17 
mmol/gram 1 ) was placed in a 20 mL glass scintillation vial, 
5 and treated with 2 mL 3 , 3 " -diamino-i\7-methyldipropylamine at 
55 °C for 18 hours. 

The resin was removed by filtration through a 
disposable propylene filter, and the resulting solution 
dissolved with water to a total volume of 8 mL, and 

10 purified directly by preparatory reversed phase HPLC to 
provide ImImIm-Y-PyPyPy~P-Dp-NH 2 (28 mg, 41% recovery) as a 
white powder. X H NMR (DMSO-d*) 8 10.14 (s, 1 H) , 9.89 (s, 1 
H) , 9.88 (s, 1 H), 9.83 (s, 1 H) , 9.6 (br s, 1 H) , 9.59 (s, 
1 H) , 8.36 (t, 1 H, J = 5.5 Hz), 8.09 (t, 1 H, J" = 5.0 

15 Hz), 8.03 (t, 1 H, J = 5.0 Hz), 7.9 (br S, 3 H) , 7.63 (s, 1 
H) , 7.50 (s, 1 H) , 7.44 (s, 1 H) , 7.19 (d, 1 H, J = 1.2 
Hz), 7.13 (m, 2 H) , 7.08 (d, 1 H, J = 1.3 Hz), 7.02 (d, 1 
H, J = 1.2 Hz), 6.85 (m, 2 H) , 4.01 (s, 3 H) , 3.99 (s, 3 
H) , 3.97 (m, 6 H) , 3.80 (s, 3 H) , 3.77 (s, 3 H) , 3.34 (q, 2 

20 H, J 5.3 Hz), 3.23 (q, 2 H, J = 6.0 Hz), 3.05 (m, 6 H) , 
2.83 (q, 2 H, J = 5.0 Hz), 2.70 (d, 3 H, J = 4.0 Hz), 2.32 
(t, 2 H, J = 6.9 Hz), 2.25 (t, 2 H, J = 6.9 Hz), 1.90 (m, 2 
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H)-, 1,77 (m, 4 H) . MALDI -TOF-MS , 1022.8 (1023.1 calc. for 
M+H) . 

18 . ImPyPy-G-PyPyPy-G-Dp-NH 2 

5 Polyamide was prepared by manual solid phase methods 

as a white powder upon cleavage of 24 0 mg resin with N- 
methyl -bis (aminopropyl ) amine (2 ml, 55 °C) (19.0 mg, 44 % 

recovery after HPLC purification). 1 H NMR (DMSO-ds) 5 10.49 
(s, 1 H) , 9.97 (s, 1 H) , 9.93 (s, 1 H) , 9.91 ( S/ 1 H) , 9.89 

10 (s, 1 H), 9.7 (br s, 1 H) , 8.27 (m, 2 H) , 8.04 (t, 1 H, J 
= 5.1 Hz), 7.88 (br s, 3 H) , 7.39 (s, 1 H) , 7.27 (d, 1 H, J 
= 1.6 Hz), 7.21 (m, 3 H) , 7.15 (m, 2 H) , 7.05 (m, 2 H) , 
6.93 (m, 3 H) , 3.97 ( S/ 3 H) , 3.96 (m, 6 H) , 3.92 (m, 9 H) , 
3.72 (m, 4 H) , 3.14 (m, 6 H) , 3.05 (q, 2 H, J = 5.4 Hz), 

15 2.73 (d, 3 H, J = 3.3 Hz), 1.88 (quintet, 2 H, J = 4.6 
Hz), 1.75 (quintet, 2 H, J = 6.3 Hz). MALDI -TOF-MS, 979.0 
(979. 1 calc for M+H) . 

19 . ImPy Py - G - Py PyPy - p - Dp - NH 2 

20 Polyamide was prepared by manual solid phase methods 

as a white powder upon cleavage of 24 0 mg resin with N- 
methyl -bis (aminopropyl) amine (2 ml, 55 °C) (25 mg, 55 % 

recovery). HPLC, r.t. 22.0; X H NMR (DMSO-d6) 6 10.53 (s, 1 
H) , 10.00 (s, 1 H) , 9.98 (s, 1 H) , 9.93 (s, 1 H) , 9.92 (s, 

25 1 H) , 9.7 (br s, 1 H) , 8.31 (t, 1 H r J = 5.7 Hz), 8.12 (t, 
1 H, J = 5.5 Hz), 8.04 (t, 1 H, J = 5.6 Hz), 7.9 (br s, 3 
H) , 7.41 (s, 1 H) , 7.29 (d, 1 H, J = 1.7 Hz), 7.23 (d, 1 H, 
J = 1.5 Hz), 7.22 (d, 1 H, J = 1.4 Hz), 7.16 (m, 3 H) , 
7.07 (d, 1 H, J = 1.2 Hz), 7.03 (d, 1 H, J = 1.3 Hz), 6.94 

30 (d, 1 H, J = 1.6 Hz), 6.93 (d, 1 H, J = 1.5 Hz), 6.86 (d, 
1 H, J = 1.4 Hz), 3.98 (s, 3 H) , 3.88 (d, 2 H, J = 5.6 
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Hz), 3.83 (s, 3 H), 3.82 (m, 6 H) , 3.80 (s, 3 H) , 3.78 (s, 
3 H), 3.37 (q, 2 H, J = 6.4 Hz), 3.11 (m ( 6 H) , 2.86 (q, 2 
H, J = 6.1 Hz), 2.70 (d, 3 H, J 4.6 Hz), 2.32 (t, 2 H, J 
= 7.2 Hz), 1.87 (quintet, 2 H, J = 7.4 Hz), 1.75 (quintet, 
> 2 H, J = 6.0 Hz), MALDI-TOF-MS, 993.3 (993.1 calc for 
M+H) . 



20. ImPyPy-p-PyPyPy-G-Dp-NH 2 

Polyamide was prepared by automated solid phase 
10 methods as a white powder upon cleavage of 24 0 mg resin 
with N-methyl -bis (aminopropyl ) amine (2 ml, 55 °C) (23.0 mg, 
53 % recovery). HPLC, r.t. 20.6; X H NMR (DMSO-d 6 ) 8 10.45 
(s, 1 H), 9.95 (s, 1 H), 9.92 (m, 3 H) , 9.6 (br s, 1 H) , 
8.27 (t, 1 H, J = 4.7 Hz), 8.11 (m, 2 H) , 7.9 (s, 3 H) , 
15 7.38 (s, 1 H), 7.26 (d, 1 H, J = 1.7 Hz), 7.21 (m, 2 H) , 
7.17 (m, 2 H), 7.13 (d, 1 H, J = 1.8 Hz), 7.05 (m, 2 H) , 
6.93 (d, 1 H, J = 1.6 Hz), 6.88 (d, 1 H, J = 1.6 Hz), 6.83 
(d, 1 H, J = 1.7 Hz), 3.97 (s, 3 H) , 3.82 (s, 9 H) , 3.81 
(s, 3 H), 3.79 (s, 3 H) , 3.73 (m, 2 H) , 3.44 (q, 2 H, J = 
20 5.5 Hz), 3.2 (m, 6 H) , 2.85 (q, 2 H, J = 5.8 Hz), 2.73 (d, 
3 H, J = 4.5 Hz), 1.89 (quintet, 2 H, J = 6.4 Hz), 
1.77 (quintet, 2 H, J = 6.9 Hz) MALDI-TOF-MS, 992.9 (993.1 
calc for M+H) . 
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21. ImPyPy-y-ImPyPy-p-PyPyPy-G-Dp-NH2 




The polyamide was prepared by machine-assisted solid 
phase methods as a white powder. (29 mg 59 % recovery) . 

5 HPLC r.t. 21.5, 1 H NMR (DMS0-d6) ; 8 10.50 (s, 1 H) , 10.27 
(s, 1 H) , 9.96 (s, 1 H), 9.93 (m, 5 H) , 9.2 ( br a, 1 H) , 
8.27 (t, 1 H, J = 5.1 Hz), 8.03 (m, 3 H) , 7.90 (s, 3 H) , 
7.45 ( S< 1 H), 7.40 (s, 1 H) , 7.27 (d, 1 H, J = 1.3 Hz), 
7.25 (d, 1 H, J = 1.4 Hz), 7.22 (m, 2 H) , 7.18 (m, 2 H) , 

10 7.17 (d, 1 H, J = 1.4 Hz), 7.14 (d, 1 H, J = 1.3 Hz), 7.11 
(m, 2 H), 7.06 (d, 1 H, J = 1.5 Hz), 6.94 (d, 1 H, J = 1.3 
Hz), 6.88 (m, 2 H) , 6.84 (d, 1 H, J =1.4 Hz), 3.97 (s, 3 
H) , 3.93 (s, 3 H) , 3.83 (m, 9 H) , 3.80 (m, 6 H) , 3.76 (m, 6 
H), 3.72 (d, 2 H, J = 5.2 Hz) , 3.43 (q, 2 H, J = 5.0 Hz), 

15 3.17 (m, 6 H) , 3.11 (q, 2 H, J = 5.3 Hz), 2.85 (q, 2 H, J = 
5.2 Hz), 2.73 (d, 3 H, J = 3.9 Hz), 2.51 (t, 2 H, J = 6.5 
Hz), 2.35 (t, 2 H, J = 6.7 Hz), 1.92 (quintet, 2 H, J = 6.8 
Hz), 1.78 (m, 4 H) . MALDI-TOF MS 1445.6 (1445.6 calc for 
M+H) . 

20 
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22. ImlmlmPy - y - Py Py Py Py - P - Dp - NH2 



A sample of machine-synthesized resin (350 mg, 0.16 
mmol/gram) was placed in a 20 mL glass scintillation vial, 
5 and treated with 2 mL 3 , 3 ' -diamino-W-methyldipropylamine at 
55 °C for 18 hours. The resin was removed by filtration 
through a disposable propylene filter, and the resulting 
solution dissolved with water to a total volume of 8 mL, 
and purified directly by preparatory reversed phase HPLC to 

10 provide ImImImPy-y-PyPyPyPy-p-Dp-NH2 (31 mg, 40% recovery) 
as a white powder. X H NMR (DMS0-d 6 ) 5 10.37 (s, 1 H) , 10.16 
(s, 1 H) , 9.95 (s, 1 H) , 9.93 (s, 1 H) , 9.91 (s, 1 H) , 9.86 
(s, 1 H) , 9.49 (br s, 1 H) , 9.47 (s, 1 H) , 8.12 (m, 3 H) , 
8.0 (br s , 3 H) , 7.65 (s, 1 H) , 7.57 (s, 1 H) , 7.46 (s, 1 

15 H) , 7.20 (m, 3 H) , 7.16 (m, 2 H) , 7.09 (d, 1 H, J = 1.5 
Hz), 7.05 (m, 2 H) , 7.00 (d, 1 H, J 1.6 Hz), 6.88 (m, 2 
H), 4.01 (s, 3 H), 3.99 (s, 3 H) , 3.98 (s, 3 H) , 3.83 (s, 3 
H), 3.82 (s, 3 H), 3.81 (s, 3 H) , 3.79 (s, 3 H) , 3.78 (s, 3 
H) , 3.36 (q, 2 H, J = 5.3 Hz), 3.21- 3.05 (m, 8 H) , 2.85 

20 (q, 2 H, J = 4.9 Hz), 2.71 (d, 3 H, J = 4.4 Hz), 2.34 (t, 
2 H, J = 5.9 Hz), 2.26 (t, 2 H, J = 5.9 Hz), 1.85 
(quintet, J = 5 . 7 Hz) , 1 . 72 (m, 4 H) . MALDI-TOF-MS, 1267.1 
(1267.4 calc. for M+H) . 
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23 . ImPyPyPyPy-y-ImPyPyPyPy-p-Dp-NH2 




A sample of ImPyPyPyPy-y-ImPyPyPyPy-p-resin prepared by 
5 machine-assisted solid phase synthesis (240 mg, 0.16 
mmol/gram) was placed in a 20 mL glass scintillation vial, 
and treated with 3 , 3-diamino-N-methyldipropylamine (2 mL) 
at 55 °C for 18 hours. Resin was removed by filtration, and 
the filtrate diluted to a total volume of 8 mL with 0.1 % 
10 (wt/v) aqueous TFA. The resulting crude polyamide/amine 
solution was purified directly by reversed phase HPLC to 
provide the trif luoroacetate salt of ImPyPyPyPy-y- 
ImPyPyPyPy-p-NH2 (31 mg, 40% recovery) as a white powder. UV 
^max 241, 316 (e) 83300 (calculated based on e 

15 8,333/ring 5 ) ; X H NMR (DMSO-d^) 5 10.53 (s, 1 H) , 10.28 (s, 1 
H), 10.03 (s, 1 H) , 10.00 (s, 1 H) , 9.96 (m, 2 H) , 9.92 (m, 
2 H) , 9.6 (br s, 1 H), 8.07 (m, 4 H) , 7.89 (s, 3 H) , 7.45 
(s, 1 H), 7.41 (s, 1 H), 7.27 (d, 2 H, J= 8.5 Hz), 7.23 (m, 
4 H) , 7.16 (m, 4 H), 7.06 (m, 4 H) , 6.87 (m, 2 H) , 3.98, 

20 (s, 3 H) , 3.94 (s, 3 H) , 3.84, (m, 6 H), 3.79 (s, 3 H) , 
3.35 (q, 2 H, J= 5.7 Hz), 3.16 (m, 8 H) , 2.85 (q, 2 H, J= 
5.6 Hz), 2.72 (d, 2 H, J= 4.2 Hz), 2.34 (m, 2 H) , 1.91 (m, 
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4.H), 1.78 (m, 4 H) . MALDI-TOF MS, 1510.4 (1510.7 calc . 
for M+H) . 



24. ImImPyPyPy-y-ImPyPyPyPy-p-Dp-NH2 : 



5 




The polyamide was prepared as a white powder 



described for ImPyPyPyPy-y-imPyPyPyPy-p-NH 2 . 1 H NMR (DMSO- 
d 6 ) 5 10.39 (s, 1 H), 10.28 (s, 1 H) , 10.03 (s, 1 H) , 10.00 
(s, 1 H), 9.92 (m, 2 H) , 9.82 (s, 1 H) , 9.66 (br s, 1 H) , 

10 8.11 (m, 4 H), 7.89 (s, 3 H) , 7.57 (s, 1 H) , 7.46 (d, 2 H, 
J=2.4 Hz), 7.27 (dd, 2 H, J- 1.0 Hz) 7.23 (m, 4 H) , 7.16 
(m, 4 H) , 7.08 (m, 4 H) , 6.88 (m, 1 H) , 4.00 (s, 3 H) , 3.94 
(s, 3 H), 3.78 (s, 3 H) , 3.19 (q, 2 H, J=5.1 Hz), 3.05 (m, 
8 H), 2.86 (q, 2 H, J= 4.8 Hz), 2.72 (d, 2 H, J= 4.4 Hz), 

15 2.34 (m, 4 H) , 1.90 (m, 4 H) , 1.78 <m, 4 H) . MALDI-TOF-MS, 
1510.4 (1511.7 calc. for M+H) . 
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2 5 . Imlmlm -y- PyPyPy - p - Dp - EDTA 




O 

EDTA- di anhydride (50 mg) was dissolved in 1 mL 



DMSO/NMP solution and 1 mL DIEA by heating at 55 °C for 5 
5 min. The dianhydride solution was added to Imlmlm-y-PyPyPy- 
p-Dp-NH 2 (8.0 mg, 7 /xmol) dissolved in 750 DMSO. The 

mixture was heated at 55 °C for 25 minutes, and treated 
with 3 mL 0 . 1M NaOH, and heated at 55 °C for 10 minutes. 
0.1% TFA was added to adjust the total volume to 8 mL and 

10 the solution purified directly by preparatory HPLC 
chromatography to provide Imlmlm-y-PyPyPy-p-Dp-EDTA as a 
white powder. (3.3 mg, 3 0% recovery) X H NMR (DMSO-d 6 ) d 
10.14 (s, 1 H) , 9.90 (s, 1 H), 9.89 (s, 1 H) , 9.85 (s, 1 
H) , 9.58 (s, 1 H), 9.3 (br s, 1 H) , 8.40 (m, 2 H) , 8.02 (m, 

15 2 H), 7.65 (s, 1 H), 7.51 (s, 1 H) , 7.45 (s, 1 H) , 7.20 (d, 
1 H, J = 1.5 Hz), 7.15 (m, 2 H) , 7.08 (d, 1 H, J = 1.1 
Hz), 7.04 (d, 1 H, J 1.5 Hz), 6.86 (m, 2 H) , 4.00 (s, 3 
H) , 3.98 (s, 3 H) , 3.94 (s, 3 H) , 3.87 (m, 4 H) , 3.82 (s, 3 
H) , 3.81 (s, 3 H) , 3.78 (s, 3 H) , 3.72 (m, 4 H) , 3.4-3.0 

20 (m, 16 H) , 2.71 (d, 3 H, J = 4.2 Hz), 2.33 (t, 2 H, J = 5.1 
Hz), 2.25 (t, 2 H, J = 5.9 Hz), 1.75 (m, 6 H) . MALDI -TOF- 
MS, 1298.4 (1298.3 calc . for M+H) . The polyamide was loaded 
with Fe(II) by standard methods. 
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26. ImPyPy-y-ImPyPy-P-Dp 




The polyamide was prepared by machine-assisted solid 
phase methods as a white powder. (17 mg, 56% recovery) . 
5 HPLC r.t. 26.1, UV ^ (e) , 234 (39,300), 312 (53,200) nm; 
X H NMR (DMS0-d 6 ) ; d 10.53 (s, 1 H) , 10.27 (s, 1 H) , 10.04 
(s, 1 H) , 9.96 (s, 1 H) , 9.94 (s, 1 H) , 9.2 (br s, 1 H) , 
8.08 (m, 3 H) , 7.49 (s, 2 H) , 7.44 (s, 1 H) , 7.31 (d, 1 H, 
J = 1.0 Hz), 7.23 (d, 1 H, J = 1.1 Hz), 7.19 (m, 3 H) , 7.10 

10 (s, 1 H) , 6.92 (d, 1 H, J = 1.1 Hz), 6.90 (d, 1 H, J = 1.1 
Hz). 4.01 (s, 3 H) , 3.97 (s, 3 H) , 3.86 (m, 6 H) , 3.82 (m, 
6 H) , 3.41 (q, 2 H, J = 6.0 Hz), 3.22 (q, 2 H, J = 5.9 Hz), 
3.13 (q, 2 H, J = 5.9 Hz), 3.0 (q, 2 H, J = 5.6 Hz), 2.76 
(d, 6 H, J = 4.8 Hz), 2.37 (m, 4 H) , 1.78 (m, 4 H) ; MALDI - 

15 TOF MS 979.3 (979.1 calc. for M+H) . 



27. ImPyPy-G-PyPyPy-G-Dp 
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Polyamide was prepared by manual solid phase methods 
and obtained as a white powder upon cleavage of 240 mg 
resin, (initial subsitution of 0.2 mmol Boc-Glycine/gram) 
with dimethylaminopropylamine (11.9 mg, 29% recovery). 

5 HPLC, r.t. 26.9 min.; UV U. (e) . 246 (41,100), 312 (48,400) 
nm; X H NMR (DMSO-d 6 ) 5 10.49 (s, 1 H) , 9.98 (s, 1 H) , 9.95 
(s, 1 H) , 9.92 (s, 1 H) , 9.89 (s, 1 H) , 9.2 (br s, 1 H) , 
8.30 (m, 2 H) , 8.06 (t, 1 H, J" = 5.8 Hz), 7.40 (s, 1 H) , 
7.24, (d, 1 H, J = 1.7 Hz), 7.23 (m, 3 H) , 7.17 (m, 2 H) , 

0 7.06 (m, 2 H) , 6.94 (m, 3 H) , 3.99 (s, 3 H) , 3.89 (d, 2 H, 
J = 5.8 Hz), 3.84 (s, 3 H) , 3.84 (s, 3 H) , 3.83 (s, 3 H) , 
3.81 (s, 3 H) , 3.80 (s, 3 H) , 3.72 (d, 2 H, J = 4.3 Hz), 
3.13 (q, 2 H, J = 5.7 Hz), 3.01 (q, 2 H, J = 5.2 Hz), 2.76 
(d, 6 H, J = 4.3 Hz), 1.77 (quintet, 2 H, J = 7.4 Hz); 

5 MALDI-TOF MS 935.7 (M + H 936.0 calc for C 44 H 55 N 16 0 8 ) ; FABMS 
m/e 935.433 (M + H 935.439 calcd. for C 44 H 55 N 16 0 8 ) . 

28. ImPyPy-G-PyPyPy-p-Dp 

1 ° vv 



20 




N 

" 6 

H 

o o 



Polyamide was prepared by manual solid phase methods 
as a white powder upon cleavage of 180 mg resin (initial 
subsitution of 0.2 mmol Boc-P-alanine/gram) with 
dimethylaminopropylamine (2 ml, 55 °C) . (12.3 mg, 38 % 
recovery after HPLC purification). HPLC, r.t. 25.5, UV 
25 (8), 246 (39,500), 312 (52,000) nm; X H NMR (DMSO-d 6 ) / 10.46 
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(s, 1 H) , 9.96 (s, 1 H) , 9.90 (s, 1 H) , 9.88 (m, 2 H) , 9.21 
(br s, 1 H) , 8.27 (t, 1 H, J = 4.2 Hz), 8.06 (m, 2 H) , 7.39 
(s, 1 H) , 7.28 (d, 1 H, J = 1.6 Hz), 7.23 (d/ 1 H, J = 1.7 
Hz), 7.20 (d, 1 H, J = 1.5 Hz), 7.15 (m, 3 H) , 7.04 (m, 2 
5 H), 7.03 (d, 1 H, J = 1.6 Hz), 6.94 (d, 1 H, J = 1.7 Hz), 
6.92 (d, 1 H, J = 1.4 Hz), 3.98 (s, 3 H) , 3.88 (d, 2 H, J 
= 5.6 Hz), 3.83 (s, 3 H) , 3.82 (m, 6 H) , 3.79 (s, 3 H) , 
3.78 (s, 3 H) , 3.36 (q, 2 H, J = 5.3 Hz), 3.09 (q, 2 H, J = 
6.0 Hz), 2.75 (q, 2 H, J = 5.2 Hz), 2.72 (d, 6 H, J - 4.8 
10 Hz), 2.30 (t, 2 H, J = 6.3 Hz), 1.72 (quintet, 2 H, J = 5.7 
Hz) MALDI-TOF MS 950.1 (950.0 calc for M + H) ; FABMS m/e 
949.462 (M +H 949.455 calc. for C 4 5H 5 7N 16 0 8 ) . 

29 . ImPy Py - P - PyPy Py - G - Dp 

15 Polyamide was prepared by automated solid phase 

methods as a white powder upon cleavage of 180 mg resin 
(initial subsitution of 0.2 mmol Boc- Glycine /gram) with 
dimethyl aminopropyl amine (2 ml, 55 °C) (17.2 mg, 57 % 
recovery after HPLC purification). HPLC, r.t. 26.5; UV X^a X 

20 (8), 246 (46,500), 312 (54,800) nm; *H NMR (DMS0-d 6 ) 8 10.54 
(s, 1 H) , 9.92 (s, 1 H) , 9.90 (m, 3 H) , 9.23 (br s, 1 H) , 
8.27 (t, 1 H, J = 5.5 Hz), 8.06 (t, 1 H, J = 6.3 Hz), 8.03 
(t, 1 H, J = 6.2 Hz), 7.39 (s, 1 H) , 7.26 (d, 1 H, J = 1.7 
Hz), 7.20 (m, 2 H) , 7.17 (m, 3 H) , 7.13 (m, 2 H) , 7.04 (d, 

25 1 H, J = 1.5 Hz), 6.87 (d, 1 H, J = 1.8 Hz), 6.83 (d, 1 H, 
J = 1.8 Hz), 3.97 (s, 3 H) , 3.82 (m, 15 H) , 3.78 (d, 2 H, J 
= 3.4 Hz), 3.27 (m, 4 H) , 3.15 (m, 2 H) , 3.79 (m, 2 H) , 
2.76 (d, 6 H, J = 4.9 Hz), 1.78 (quintet, 2 H, J = 6.6 Hz) 
MALDI-TOF MS 950.2 (950.0 calc. for M + H) ; FABMS m/e 

30 949.458 (M + H 949.455 calc. for C 4 5H57N 16 0 8 ) . 
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30. ImPyPy-P-PyPyPy-p-Dp 

Polyamide was prepared by automated solid phase methods 
as a white powder upon cleavage of 240 mg resin (initial 
subsitution of 0.2 mmol Boc-p-alanine/gram) with 
> dimethylaminopropylamine (2 ml, 55 °C) . (19.0 mg, 43 % 
recovery after HPLC purification). HPLC, r.t. 26.8; UV X^ x 
(e) , 246 (42, 100), 312 (53,900) nm; X H NMR (DMSO-d 6 ) 5 10.56 
(s, 1 H), 9.90 (s, 1 H), 9.89 (m, 2 H) , 9.87 (s, 1 H) , 9.21 
(br s, 1 H) , 8.24 (t, 1 H, J = 5.2 Hz) , 8 . 04 . (t, 1 H, J = 
6.1 Hz), 8.01 (t, 1 H, J = 6.0 Hz), 7.35 (s, 1 H) , 7.26 (d, 
1 H, J = 1.6 Hz), 7.23 (m, 3 H) , 7.16 (m, 3 H) , 7.12 (m, 1 
H) , 7.02 (d, 1 H, J = i.s Hz) , 6.85 (d, 1 H, J = 1.9 Hz), 
6.80 (d, 1 H, J = 1.8 Hz), 3.96 (s, 3 H) , 3.79 (s, 3 H) , 
3.78 (s, 3 H), 3.36 (q, 2 H, J = 5.3 Hz), 3.09 (q, 2 H, J = 
6.0 Hz), 2.75 (q, 2 H, J = 5.0 Hz), 2.72 (d, 6 H, J = 4.7 
Hz), 2.30 (t, 2 H, J = 6.1 Hz), 1.72 (quintet, 2 H, J = 5.5 
Hz); MALDI-TOF MS 964.2 (964.1 calc. for M + H) 



31. ImPy Py - Py - PyPyPy - G - Dp 

20 Polyamide was prepared by manual solid phase methods. 

Recovery is based on cleavage of 18 0 mg resin (initial 
subsitution of 0.2 mmol Boc-Glycine/gram) with 
dimethylaminopropylamine (2 ml, 55 °C) . (8 mg, 24% recovery 
after HPLC purification). A small quantity of the failure 

25 heptamide AcPyPyPyPyPyPy-Dp was found in the initial 
preparation and was removed by a second preparatory HPLC 
purification to afford pure ImPyPy-Py-PyPyPy-G-Dp as a 
white powder (1.2 mg) . HPLC, r.t. 28.5, UV U (e) , 246 
(34,600), 312 (55,300); X H NMR (DMSO-d 6 ) 5 10.55 (s, 1 H) , 

30 10.02 (s, 1 H) , 10.00 (m, 4 H) , 9.3 (br s, 1 H) , 8.32 (t, 1 
H, J = 6.2 Hz), 8.06 (t, 1 H, J = 5.9 Hz), 7.44 (s, 1 H) , 
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7,31 (d, 1 H, J = 1.7 Hz), 7.26 (m, 5H) , 7.19 (d, 1 H, J 
= 1.8 Hz), 7.10 (m, 5 H), 6.97 (d, 1 H, J 1.7 Hz), 4.01 

(s, 3 H), 3.87 (m, 15 H) , 3.82 (s, 3 H) , 3.73 "(d, 2 H, J = 

5.5 Hz), 3.16 (q, 2 H, J = 6.2 Hz), 3.03 (q, 2 H, J = 

5 5.2 Hz), 2.74 (d, 6 H, J = 4.9 Hz), 1.77 (quintet, 2 H, J 

= 6.7 Hz); MALDI-TOF MS 1000.5; FABMS jn/e 1001.471 (M + H 
1001.473 calcd. for C 48 H 59 N 17 0 8 ) . 
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32. ImPyPy-Py-PyPyPy-p-Dp 

Polyamide was prepared by machine assisted solid phase 
synthesis to afford a white powder upon cleavage of 800 mg 
resin (initial subsitution of 0.2 mmol Boc-p-alanine/gram) 
with dimethylaminopropylamine (2 ml, 55 °C) . (56 mg, 36 % 
recovery after HPLC purification) (e) 246 (34,800), 308 
(57,000); HPLC r.t. 27.9 min.; 2 H NMR (DMSO-d 6 ) 5 10.47 (s, 
1 H), 9.95 (m, 4 H) , 9.89 (s, 1 H) , 9.2 (br s, 1 H) , 8.03 
(m, 2 H), 7.39 (s, 1 H) , 7.27 (d, 1 H, J = 1.3 Hz), 7.22 
(m, 4 H) , 7.15 <m, 2 H) , 7.07 (m, 4 H) , 7.03 (d, 1 H, J = 
1.4 Hz), 6.86 (d, 1 H, J = 1.0 Hz), 3.97 (s, 3 H) , 3.84 (m, 
12 H), 3.82 (s, 3 H), 3.77 (s, 3 H) , (P-ala quartet covered 
by water.), 3.11 (q, 2 H, J = 5.1 Hz) , 3.08 (q, 2 H, J = 
6.0 Hz), 2.72 (d, 6 H, J = 4.8 Hz), 2.34 (t, 2 H, J = 4.4 
Hz), 1.7 (m, 2 H) ; MALDI -TOF-MS , 1014.7 (1015.1 calc for 
M+H) . 

33. ImPyPy-G-PyPyPy-G-Dp 

Polyamide was prepared by manual solid phase methods 
as a white powder upon cleavage of 240 mg resin with W- 
methyl-bis(aminopropyl) amine (2 ml, 55 °C) (19.0 mg, 44 % 
recovery after HPLC purification). X H NMR (DMSO-d 6 ) 8 10.49 
(s, 1 H), 9.97 (s, 1 H), 9.93 (s, 1 H) , 9.91 (s, 1 H) , 9.89 
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(s, 1 H), 9.7 (br s, 1 H), 8.27 (m, 2 H) , 8.04 (t, 1 H, J 
= 5.1 Hz), 7.88 (br s, 3 H) , 7.39 (s, 1 H) , 7.27 (d, 1 H, J 
= 1.6 Hz), 7.21 (m, 3 H) , 7.15 (m, 2 H) , 7.05 (m, 2 H) , 
6.93 (m, 3 H), 3.97 (s, 3 H) , 3.96 (m, 6 H) , 3.92 (m, 9 H) , 
5 3.72 (m, 4 H), 3.14 (m, 6 H) , 3.05 (q, 2 H, J = 5.4 Hz), 
2.73 (d, 3 H, J = 3.3 Hz), 1.88 (quintet, 2 H, J = 4.6 
Hz), 1.75 (quintet, 2 H, J = 6.3 Hz). MALDI -TOF-MS, 979.0 
(979. 1 calc for M+H) . 

10 34. ImPyPy-G-PyPyPy-p-Bp 

Polyamide was prepared by manual solid phase methods 
as a white powder upon cleavage of 240 mg resin with N- 
methyl -bis (aminopropyl) amine (2 ml, 55 °C) (25 mg, 55 % 
recovery). HPLC, r.t. 22.0; *H NMR (DMS0-d 6 ) 5 10.53 (s, 1 

15 H), 10.00 (s, 1 H), 9.98 (s, 1 H) , 9.93 (s, 1 H) , 9.92 (s, 
1 H), 9.7 (br s, 1 H), 8.31 (t, 1 H, J = 5.7 Hz), 8.12 (t, 
1 H, J = 5.5 Hz), 8.04 (t, 1 H, J = 5.6 Hz), 7.9 (br s, 3 
H), 7.41 (s, 1 H), 7.29 (d, 1 H, J = 1.7 Hz), 7.23 (d, 1 H, 
J = 1.5 Hz), 7.22 (d, 1 H, J = 1.4 Hz), 7.16 (m, 3 H) , 

20 7.07 (d, 1 H, J = 1.2 Hz), 7.03 (d, 1 H, J = 1.3 Hz), 6.94 
(d, 1 H, J = 1.6 Hz), 6.93 (d, 1 H, J = 1.5 Hz), 6.86 (d, 

1 H, <J = 1.4 Hz), 3.98 (s, 3 H) , 3.88 (d, 2 H, J = 5.6 
Hz), 3.83 (s, 3 H), 3.82 (m, 6 H) , 3.80 (s, 3 H) , 3.78 (s, 
3 H), 3.37 (q, 2 H, J = 6.4 Hz), 3.11 (m, 6 H) , 2.86 (q, 2 

25 H, J = 6.1 Hz), 2.70 (d, 3 H, J = 4.6 Hz), 2.32 (t, 2 H, J 
= 7.2 Hz), 1.87 (quintet, 2 H, J = 7.4 Hz), 1.75 (quintet, 

2 H, J = 6.0 Hz), MALDI -TOF-MS, 993.3 (993.1 calc for 
M+H) . 
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35. ImPyPy-P-PyPyPy-G-Dp 

Polyamide was prepared by automated solid phase 
methods as a white powder upon cleavage of 24 0 mg resin 
with W-methyl-bis(aminopropyl) amine (2 ml, 55 °C) (23.0 mg, 
S 53 % recovery). HPLC, r.t. 20.6; X H NMR (DMS0-d 6 ) 5 10.45 
(s, 1 H), 9.95 (s, 1 H), 9.92 (m, 3 H) , 9.6 (br s, 1 H) , 
8.27 (t, 1 H, J = 4.7 Hz), 8.11 (m, 2 H) , 7.9 (s, 3 H) , 
7.38 (s, 1 H), 7.26 (d, 1 H, J = 1.7 Hz), 7.21 (m, 2 H) , 
7.17 (m, 2 H), 7.13 (d, 1 H, J = 1.8 Hz), 7.05 (m, 2 H) , 
6.93 (d, 1 H, J = 1.6 Hz), 6.88 (d, 1 H, J = 1.6 Hz), 6.83 
(d, 1 H, J = 1.7 Hz), 3.97 (s, 3 H) , 3.82 (s, 9 H) , 3.81 
(s, 3 H), 3.79 (s, 3 H) , 3.73 (m, 2 H) , 3.44 (q, 2 H, J = 
5.5 Hz), 3.2 (m, 6 H) , 2.85 (q, 2 H, J = 5.8 Hz), 2.73 (d, 
3 H, J = 4.5 Hz), 1.89 (quintet, 2 H, J = 6.4 Hz), 
1.77 (quintet, 2 H, J = 6.9 Hz) MALDI-TOF-MS, 992.9 (993.1 
calc for M+H) . 



20 



25 



30 



36. ImPyPy-G - Py PyPy - G - Dp - EDTA 

EDTA-dianhydride (50 mg) was dissolved in 1 mL DMSO/NMP 
solution and 1 mL DIEA by heating at 55 °C for 5 min. The 
dianhydride solution was added to ImPyPy-G-PyPyPy-G-Bp 
(12.0 mg, 11 nmol) dissolved in 750 fih DMSO. The mixture 
was heated at 55 °C for 25 minutes, and treated with 3 mL 
0.1M NaOH, and heated at 55 °C for 10 minutes. 0.1% TFA was 
added to adjust the total volume to 8 mL and the solution 
purified directly by preparatory HPLC chromatography to 
provide I mPy Py - G - Py Py Py - G - Bp - EDTA as a white powder. (4.7 
mg, 31% recovery after HPLC purification); HPLC, r.t. 28.8; 
*H NMR (DMS0-d 6 ) 6 10.49 (s, 1 H) , 9.97 (s, 1 H) , 9.91 (s, 1 
H) , 9.89 (m, 2 H) , 9.4 (br s, 1 H) , 8.42 (t, 1 H, J = 5.0 
Hz), 8.31 (t, 1 H, J = 5.5 Hz), 8.00 (m, 2 H) , 7.38 (s, 1 
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H) , 7.26 (d, 1 H, J = 1.5 Hz), 7.22 (d, 1 H, J = 1.4 Hz), 
7.20 (d, 1 H, J = 1.4 Hz), 7.14 (m, 3 H) , 7.03 (m, 2 H) , 
6.92 (d, .1 H, J = 1.5 Hz), 3.95 (s, 3 H) , 3.85 (m, 4 H) , 
3.84 (s, 3 H), 3.80 (m, 6 H) , 3.78 (s, 3 H) , 3.76 (s, 3 H) , 
5 3.69 (m, 6 H) , 3.55 (q, 2 H, J = 5.7 Hz), 3.3-3.0 (m, 12 
H), 2.69 (d, 3 H, J = 3.9 Hz), 2.31 (t, 2 H, J = 6.8 Hz), 
1.73 (m, 4 H) ; MALDI -TOF-MS , 1254.8 (1254.3 calc for M+H) . 

3 7 . ImPyPy- G - PyPyPy - P - Bp - EDTA 

10 Polyamide was prepared from ImPyPy-G-PyPyPy-p-Bp (20 

mg) as described for ImPyPy-G-PyPyPy-G-Bp-EDTA. (13.0 mg, 
55 % recovery after HPLC purification). HPLC, r.t. 27.3; X H 
NMR (DMSO-d 6 ) 6 10.49 (s, 1 H) , 9.97 (s, 1 H) , 9.91 (s, 1 
H) , 9.89 (m, 2 H) , 9.4 (br s, 1 H) , 8.42 (t, 1 H, J = 5.0 

15 Hz), 8.31 (t, 1 H, J = 5.5 Hz), 8.00 (m, 2 H) , 7.38 (s, 1 
H) , 7.26 (d, 1 H, J = 1.5 Hz), 7.22 (d, 1 H, J = 1.4 Hz), 

7.20 (d, 1 H, J = 1.4 Hz), 7.14 (m, 3 H) , 7.03 (m, 2 H) , 
6.92 (d, 1 H, J = 1.5 Hz), 3.95 (s, 3 H) , 3.85 (m, 4 H) , 
3.84 (s, 3 H) , 3.80 (m, 6 H) , 3.78 (s, 3 H) , 3.76 (s, 3 H) , 

20 3.69 (m, 6 H) , 3.55 (q, 2 H, J = 5.7 Hz), 3.3-3.0 (m, 12 
H) , 2.69 (d, 3 H, J = 3.9 Hz), 2.31 (t, 2 H, J = 6.8 Hz), 
1.73 (m, 4 H) ; MALDI -TOF-MS , 1268.5 (1268.3 calc for M+H) . 

38. ImPyPy-p-PyPyPy-G-Bp-EDTA 

25 Polyamide was prepared from ImPyPy-P-PyPyPy-G-Bp (12 

mg) as described for ImPyPy-G-PyPyPy-G-Bp-EDTA. (6 mg, 42 % 
recovery after HPLC purification). HPLC, r.t. 28.0; X H NMR 
(DMSO-d 6 ) 5 10.46 (s, 1 H) , 9.95 (s, 1 H) , 9.93 (m, 3 H) , 
9.9 (br s, 1 H) , 8.43 (t, 1 H, J = 5.1 Hz), 8.28 (t, 1 H, 

30 J = 5.3 Hz), 8.03 (m, 2 H) , 7.38 (s, 1 H) , 7.26 (m, 2 H) , 

7.21 (d, 1 H, J = 1.6 Hz), 7.17 (d, 1 H, J = 1.8 Hz), 7.12 
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(d, 1 H, J = 1.8 Hz), 7.10 (s, 1 H) , 7.04 (d, 1 H, J = 1.6 
Hz), 6.93 (m, 2 H) , 6.88 (d, 1 H, J = 1.6 Hz), 6.84 (d, 1 
H, J = 1.4 Hz), 3.97 (s, 3 H) , 3.87 (m, 4 H)', 3.82 (m, 9 
H), 3.79 (s, 3 H), 3.78 (s, 3 H) , 3.68 (m, 6 H) , 3.3-2.9 
5 (m, 16 H), 2.71 (d, 3 H, J = 4.1 Hz), 1.78 (m, 4 H) ; 
MALDI -TOF-MS , 1268.9 (1268.3 calc for M+H) . 



39. ImPy Py -y - ImPyPy - [3 - Py PyPy - G - Dp 




10 

The polyamide was prepared by machine-assisted solid 
phase methods ' as a white powder. (12 rag 19 I recovery). 
HPLC r.t. 29.5, UV ^ ax (s) , 238 (53,900), 312 (71,100) nm; 
X H NMR (DMSO-d 6 ); d 10.46 (s, 1 H) , 10.24 (s, 1 H) , 9.96 (s, 

15 1 H), 9.90 (m, 5 H) , 9.2 ( br s, 1 H) , 8.25 (m, 1 H) , 8.00 
(m, 3 H), 7.44 (s, 1 H) , 7.39 (s, 1 H) , 7.26 (d, 1 H, J = 
1.3 Hz), 7.24 (d, 1 H, J = 1.5 Hz), 7.20 (m, 2 H) , 7.16 (m, 
2 H) , 7.13 (m, 2 H) , 7.11 (d, 1 H, J = 1.4 Hz), 7.05 (d, 1 
H, J = 1.4 Hz), 7.03 (d, 1 H, J = 1.5 Hz), 6.93 (d, 1 H, J 

20 = 1.3 Hz), 6.87 (m, 2 H) , 6.84 (d, 1 H, J = 1.5 Hz), 3.97 
(s, 3 H), 3.92 (s, 3 H) , 3.82 (m, 9 H) , 3.79 (m, 6 H) , 3.76 
(m, 6 H) , 3.73 (m, 2 H) , 3.44 (q, 2 H, J = 5.0 Hz), 3.17 
(m, 4 H) , 3.03 (m, 2 H) , 2.74 (d, 6 H, J = 4.8 Hz), 2.50 
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(m, 2 H) 2.33 (t, 2 H, J = 6.7 Hz), 1.77 (m, 4 H) . MALDI- 
TOF MS 1402.2 (1402.5 calc for M+H) . 



40. ImPy Py -y - ImPy Py- p - Py Py Py - G - Dp -NH 2 

9, i 



5 




The polyamide was prepared by machine-assisted solid 
phase methods as a white powder. (29 mg 59 % recovery) . 
HPLC r.t. 21.5, X H NMR (DMSO-d 6 ) ; 8 10.50 (s, 1 H) , 10.27 
(s, 1 H), 9.96 (s, 1 H), 9.93 (m, 5 H) , 9.2 ( br s, 1 H) , 

10 8.27 (t, 1 H, J = 5.1 Hz), 8.03 (m, 3 H) , 7.90 (s, 3 H) , 
7.45 (s, 1 H), 7.40 (s, 1 H) , 7.27 (d, 1 H, J = 1.3 Hz), 
7.25 (d, 1 H, J = 1.4 Hz), 7.22 (m, 2 H) , 7.18 (m, 2 H) , 
7.17 (d, 1 H, J = 1.4 Hz), 7.14 (d, 1 H, J = 1.3 Hz), 7.11 
(m, 2 H), 7.06 (d, 1 H, J = 1.5 Hz), 6.94 (d, 1 H, J = 1.3 

15 Hz), 6.88 (m, 2 H) , 6.84 (d, 1 H, J = 1.4 Hz), 3.97 (s, 3 
H) , 3.93 (s, 3 H) , 3.83 (m, 9 H) , 3.80 (m, 6 H) , 3.76 (m, 6 
H) , 3.72 (d, 2 H, J = 5.2 Hz), 3.43 (q, 2 H, J = 5.0 Hz), 
3.17 (ra, 6 H) , 3.11 (q, 2 H, J = 5.3 Hz), 2.85 (q, 2 H, J = 
5.2 Hz), 2.73 (d, 3 H, J = 3.9 Hz), 2.51 (t, 2 H, J = 6.5 

20 Hz), 2.35 (t, 2 H, J = 6.7 Hz), 1.92 (quintet, 2 H, J = 6.8 
Hz), 1.78 (m, 4 H) . MALD1-TOF MS 1445.6 (1445.6 calc for 
M+H) . 
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EDTA-dianhydride (50 mg) was dissolved in 1 mL 
DMSO/NMP solution and 1 mL DIEA by heating at 55 °C for 5 
min. The dianhydride solution was added to ImPyPy-y-imPyPy- 
P-PyPyPy-G-Dp-NH 2 (9.0 mg, 5 fimol) dissolved in 750 nh DMSO. 
The mixture was heated at 55 °C for 25 minutes, and treated 
with 3 mL 0.1M NaOH, and heated at 55 °C for 10 minutes. 
0.1% TFA was added to adjust the total volume to 8 mL and 
the solution purified directly by reversed-phase HPLC to 
provide ImPyPy-y-lmPyPy-p-PyPyPy-G-Dp-EDTA as a white 
powder. (3 mg, 30% recovery after HPLC purification); 
MALDI-TOF MS 1720.1 (1719.8 calc for M+H) . 

42. Ac-PylmPy-y-ImPyPy-p-PyPyPy-p-Dp 

O | 

1 o o 
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The polyamide was prepared by machine assisted solid 
phase methods (29) as a white powder (5 mg, 20 % recovery) . 
UV (H20) X^ ax 242 nm, 310 nm ( e = 75,000, calculated based 
on e = 8333/ring (30)); 1H NMR (DMSO-d6) : 6 10.27 (m, 2 H) ; 

5 10.02 (s, 1 H) ; 9.99 (s, 1 H) , 9.92 (m, 3 H) , 9.90 (s, 1 
H) , 9.85 (s, 1 H) , 8.6 (br s, 1 H) , 8.08 (m, 4 H) , 7.52 (s, 
1 H) ; 7.43 (S, 1 H) ; 7.24 (m, 2 H) , 7.22 (d, 1 H, J = 1.7 
Hz), 7.20 (d, 1 H, J = 1.6 Hz), 7.19 (d, 1 H, J = 1.6 Hz), 
7.14 (d, 1 H, J = 1.5 Hz), 7.11 (d, 1 H, J = 1.6 Hz), 7.07 

10 (m, 2 H) , 7.02 (d, 1 H, J = 1.4 Hz), 6.96 (d, 1 H, J = 1.7 
Hz), 6.90 (m, 2 H) , 6.88 (d, 1 H, J = 1.8 Hz), 6.83 (d, 1 
H, J = 1.6 Hz), 3.94 (s, 3 H) , 3.92 (s, 3 H) , 3.81 (m, 12 
H) , 3.79 (s, 3 H), 3.78 (s, 3 H) , 3.78 (s, 3 H) , 3.52 (m, 4 
H) , 3.33 (m, 6 H) , 2.95 (m, 2 H) , 2.71 (d, 6 H, J = 4.7 

15 Hz), 2.32 (m, 4 H) , 1.94 (s, 3 H) , 1.73 (m, 4 H) . MALDI-TOF 
MS; 1472.1 (1472.5 calc for M+H) . 



43. DM-y-PylmPy-y-ImPyPy-P-PyPyPy-p-Dp 

9 i 




20 The polyamide was prepared by machine assisted solid 

phase methods as a white powder (13 mg, 52 % recovery) . UV 

(H20) Xmax 242 nm, 310 nm (e = 75,000, calculated based on e 
= 8333/ring (30) ) ; 1H NMR (DMSO-d6) : 8 10.28 (s, 1 H) ; 10.26 

(s, 1 H) , 9.99 (s, 1 H) , 9.96 (s, 1 H) , 9.94 (s, 1 H) , 
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9.90 (m, 3 H), 9.88 (s, 1 H) , 9.3 (br s, 1 H) , 9.2 (br s, 1 
H), 8.05 (m, 4 H), 7.52 (s, 1 H) ; 7.43 (s, 1 H) ; 7.27 (d, 1 
H, J = 1.6 Hz), 7.24 (d, 1 H, J = 1.7 Hz), 7.20 (d, 1 H, J 
= 1.6 Hz), 7.17 (m, 4 H) , 7.14 (d, 1 H, J = 1.6 Hz), 7.12 
5 (d, 1 H, J = 1.5 Hz), 7.03 (d, 1 H, J = 1.6 Hz), 6.96 (d, 1 
H, J = 1.6 Hz), 6.90 (d, 1 H, J = 1.5 Hz), 6.86 (m, 2 H) , 
3.94 (s, 3 H) , 3.92 (s, 3 H) , 3.81 (m, 12 H) , 3.78 (m, 9 
H), 3.56 (m, 4 H) , 3.39 (m, 6 H) , 2.95 (m, 4 H) , 2.76 (d, 6 
H, J = 4.6 Hz), 2.71 (d, 6 H, J = 4.6 Hz), 2.30 (m, 4 H) , 
10 1.88 (m, 2 H) , 1.73 (m, 4 H) . MALDI-TOF MS; 1543.3 (1543.6 
calc for M+H) . 



44 . DM-y-ImPyPy-y-ImPyPy-p.pyPyPy.p-pypypy.p_Dp 




The polyamide was prepared by machine assisted solid 
phase methods as a white powder (3 mg, 10 % recovery) . UV 
(H20) A-a* 23 9 nm, 308 nm (e = 100,000, calculated based on e 
= 8333/ring) ; MALDI-TOF MS; 1981.3 (1981.1 calc for M+H). 
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45. ImPyPyPy-y-ImPyPyPy-p-Dp 



5 mg, 56% recovery). HPLC, r.t.: 26.1 min; UV, (e) : 234 

nm (39,300), 312 nm (53,200); *H NMR (DMSO-d 6 ) : 5 10.53 (s, 
1 H), 10.27 (s, 1 H) , 10.04 (s, 1 H) , 9.96 (s, 1 H) , 9.94 
(s, 1 H) , 9.2 (br s, 1 H) , 8.08 (m, 3 H) , 7.49 (s, 2 H) , 
7.44 (s, 1 H) , 7.31 (d, 1 H, J = 1.0 Hz), 7.23 (d, 1 H, J = 

10 1.1 Hz), 7.19 (m, 3 H) , 7.10 (s, 1 H) , 6.92 (d, 1 H, J = 
1.1 Hz), 6.90 (d, 1 H, J = 1.1 Hz). 4.01 (s, 3 H) , 3.97 
(s, 3 H) , 3.86 (m, 6 H) , 3.82 (m, 6 H) , 3.41 (q, 2 H, J = 
6.0 Hz), 3.22 (q, 2 H, J = 5.9 Hz), 3.13 (q, 2 H, J = 5.9 
Hz), 3.0 (q, 2 H, J = 5.6 Hz), 2.76 (d, 6 H, J = 4.8 Hz), 

15 2.37 (m, 4 H) , 1.78 (m, 4 H) ; MALDI-TOF MS: 1223.4 (1223.3 
calc. for M+H) . 
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46. ImPyPyPy-y-PyPyPyPy-p-Dp: 




The polyamide ImPyPyPy-y-PyPypyPy-p-Dp was prepared by 
machine-assisted solid phase methods as a white powder (12 
5 mg, 19 % recovery). HPLC, r.t.: 29.5 min; UV, ( 8 ) : 238 

nm (53,900), 312 nm (71,100); *H NMR (DMSO-d 6 ) : 5 10.46 (s, 
1 H) , 10.24 (s, 1 H), 9.96 (s, 1 H) , 9.90 (m, 5 H) , 9.2 ( 
br s, 1 H) , 8.25 (m, 1 H) , 8.00 (m, 3 H) , 7.44 (s, 1 H) , 
7.39 (s, 1 H), 7.26 (d, 1 H, J = 1.3 Hz), 7.24 (d, 1 H, J = 

10 1.5 Hz), 7.20 (m, 2 H) , 7.16 (m, 2 H) , 7.13 (m, 2 H) , 7.11 
(d, 1 H, J = 1.4 Hz), 7.05 (d, 1 H, J = 1.4 Hz), 7.03 (d, 1 
H, J = 1.5 Hz), 6.93 (d, 1 H, J = 1.3 Hz), 6.87 (m, 2 H) , 
6.84 (d, 1 H, J = 1.5 Hz), 3.97 (s, 3 H) , 3.92 (s, 3 H) , 
3.82 (m, 9 H), 3.79 (m, 6 H) , 3.76 (m, 6 H) , 3.73 (m, 2 H) , 

15 3.44 (q, 2 H, J = 5.0 Hz), 3.17 (m, 4 H) , 3.03 (m, 2 H) , 
2.74 (d, 6 H, J = 4.8 Hz), 2.50 (m, 2 H) 2.33 (t, 2 H, J = 
6.7 Hz), 1.77 (m, 4 H) ; MALDI-TOF MS: 1222.3 (1222.3 calc 
for M+H) . 
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4 7 . ImlmlmPy - y - Py Py Py Py - P - Dp : 



10 



15 




The product was synthesized by manual solid phase 
protocols and recovered as a white powder {7.6 mg, 11% 
recovery). UV X^, 248 (42,000), 312 (48,500); X H NMR 
(DMS0-d 6 ) d 10.32 (s, 1 H) , 10.13 (s, 1 H) 9.93 (s, 1 H) , 
9.90 (s, 1 H), 9.89 (s, 1 H) , 9.84 (s, 1 H) , 9.59 (s, 1 H) , 
9.23 (br s, 1 H) , 8.09 (t, 1 H, J = 5.3 Hz), 8.04 (m, 2 H) , 
7.65 (s, 1 H) , 7.57 (s, 1 H) , 7.46 (d, 1 H, J = 0.6 Hz) 
7.22 (m, 3 H), 7.16 (s, 2 H) , 7.09 (d, 1 H, J = 0.8 Hz), 
7.06 (d, 2 H, J = 1.1 Hz), 7.00 (d, 1 H, J = 1.7), 6.88 (d, 

1 H, J = 1.8), 6.87 (d, 1 H, J = 1.8 Hz), 4.02 (s, 3 H) , 
4.00 (s, 3 H), 3.99 (s, 3 H) , 3.84 (s, 3 H) , 3.83 (s, 3 H) , 
3.83 (s, 3 H), 3.80 (s, 3 H) , 3.79 (s, 3 H) , 3.37 (q, 2 H, 
J = 6.2 Hz), 3.21 (q, 2 H, J = 6.4 Hz), 3.10 (q, 2 H, J = 
6.2 Hz), 3.00 (m, 2 H) , 2.73 (d, 6 H, J = 4.9 Hz), 2.34 (t, 

2 H, J = 7.2 Hz) , 2.28 (t, 2 H, J = 7.0 Hz) , 1.76 (m, 4 H) ; 
MALDI -TOF-MS , 1225.9 (1224.3 calc. for M+H) . 
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48 . ImlmlmPy - y - Py Py Py Py - [} - Dp - NH 2 

9 i 




A sample of polyamide machine -synthesized on resin 
(350 mg, 0.16 mmol/gram) was placed in a 20 mL glass 

5 scintillation vial, and treated with 2 mL 3 , 3 " -diamine -N- 
methyldipropylamine at 55 °C for 18 hours. The resin was 
removed by filtration through a disposable propylene 
filter, and the resulting solution dissolved with water to 
a total volume of 8 mL, and purified directly by 

10 preparatory reversed phase HPLC to provide ImlmlmPy-y- 
PyPyPyPy-P~Dp-NH 2 (31 mg, 40% recovery) as a white powder. 
X H NMR (DMSO-d 6 ) 5 10.37 (s, 1 H) , 10.16 (s, 1 H) , 9.95 (s, 
1 H) , 9.93 (s, 1 H) , 9.91 (s, 1 H) , 9.86 (s, 1 H) , 9.49 (br 
s, 1 H) , 9.47 (s, 1 H) , 8.12 (m, 3 H) , 8.0 (br s , 3 H) , 

15 7.65 (s, 1 H) , 7.57 (s, 1 H) , 7.46 (s, 1 H) , 7.20 (m, 3 H) , 
7.16 (m, 2 H) , 7.09 (d, 1 H, J = 1.5 Hz), 7.05 (m, 2 H) , 
7.00 (d, 1 H, J = 1.6 Hz), 6.88 (m, 2 H) , 4.01 (s, 3 H) , 
3.99 (s, 3 H) , 3.98 (s, 3 H) , 3.83 (s, 3 H) , 3.82 (s, 3 H) , 
3.81 (s, 3 H) , 3.79 (s, 3 H) , 3.78 (s, 3 H) , 3.36 (q, 2 H, 

20 J = 5.3 Hz), 3.21- 3.05 (m, 8 H) , 2.85 (q, 2 H, J = 4.9 
Hz), 2.71 (d, 3 H, J = 4.4 Hz), 2.34 (t f 2 H, J = 5.9 
Hz), 2.26 (t, 2 H, J = 5.9 Hz), 1.85 (quintet, J = 5.7 
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Hz), 1.72 (m, 4 H) . MALDI-TOF-MS, 1267.1 (1267.4 calc. for 
M+H) . 



4 9 . ImlmlmPy -y - PyPyPyPy - p - Dp - EDTA . 



5 




Compound was prepared as described for Imlmlm-y-PyPyPy- 
P-Dp-EDTA. (3.8 mg, 40%). X H NMR (DMSO-d s ) 8 10.34 (s, 1 H) , 
10.11 (s, 1 H), 9.92 (s, 1 H), 9.90 (s, 1 H) , 9.89 (s, 1 
H), 9.84 (s, 1 H). 9.57 (s, 1 H) , 8.42 (m, 1 H) , 8.03 (m , 

10 3 H), 7.64 (s, 1 H), 7.56 (s, 1 H) , 7.44 (s, 1 H) , 7.20 (m, 
3 H) , 7.15 (m, 2 H) , 7.07 (d, 1 H, J = 1.6 Hz), 7.05 (m, 2 
H), 6.99 (d, 1 H, J = 1.6 Hz), 6.87 (m, 2 H) , 4.00 (s, 3 
H) , 3.98 (s, 3 H)» 3.97 (s, 3 H) , 3.83 (m, 4 H) , 3.82 (s, 6 
H) , 3.79 (s, 3 H) , 3.78 (s, 6 H) , 3.67 (m, 4 H) , 3.4- 3.0 

15 (m, 16 H), 2.71 (d. 3 H, J = 4.2 Hz), 2.34 (t, 2 H, J = 
5.4 Hz), 2.25 (t, 2 H, J = 5.9 Hz), 1.72 (m, 6 H) . MALDI- 
TOF-MS, 1542.2 (1542.6 calc. for M+H) . The polyamide was 
loaded with Fe(II) by standard methods. 

20 
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50 . ImlmPyPy-y-ImlmPyPy-p-Dp: 

9 I 




The polyamide ImImPyPy-y-ImImPyPy-($-PAM-Resin was 
assembled on 0.2 mmol/gram Boc-p-PAM-resin by machine 
5 assisted synthesis. The y-Im step was assembled using Boc-y- 
Im acid (HBTU, DIEA) , all other residues was added as 
appropriate activated Boc protected monomer units. A sample 
of resin (250 mg, 0.16 mmol/gram 21 ) was placed in a 20 mL 
glass scintillation vial, 2 mL dimethylaminopropylamine 

10 added and the mixture allowed to stand at 55 °C for 18 
hours. Resin was removed by filtration through a disposable 
propylene filter, and the resulting solution diluted with 
water to a total volume of 8 mL, and purified directly by 
preparatory reversed phase HPLC to provide ImlmPyPy-y- 

15 ImlmPyPy-p-Dp (26 mg, 45% recovery) as a white powder. UV 
^nax(H 2 0) 248, 312 (66,000); 1 H NMR (DMSO-d*) d 10.34 (m, 2 
H) ; 10.32 (m, 2 H) ; 9.73 (m, 2 H) ; 9.5 (br s, 1 H) , 9.32 
(s, 1 H) ; 8.10 (m, 3 H); 7.55 (m, 2 H) ; 7.52 (s, 1 H) ; 7.44 
(s, 1 H) ; 7.23 (m, 2 H) , 7.14 (m, 4 H) ; 7.06 (d, 1 H, J = 

20 1.4 Hz); 6.86 (m, 2 H) ; 3.98 (m, 9 H) ; 3.95 (s, 3 H) ; 3.81 
(m, 6 H) ; 3.77 (m, 6 H) ; 3.31 (m, 2 H) ; 3.17 (t, 2 H, J = 
5.5 Hz) 3.06 (m, 2 H, J = 5.7 Hz); 2.93 (m, 2 H, J - 4.7 
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Hz); 2.74 (d, 6 H, J = 4.4 Hz); 2.30 (m, 4 H) ; 1.74 (m, 4 
H) ; MALDI-TOF-MS, 1224.9 (1225.3 calc. for M+H) . 



51 . ImPylmPy - y - ImPylmPy - p - Dp 



10 



15 



20 




The polyamide ImPylmPy-y-ImPylmPy-P-PAM-Resin was 
assembled on 0.2 mmol/gram Boc-P-PAM-resin by manual 
polyamide synthesis. The Py-Im and y-Im steps were 
addedusing Boc-y-Im acid and Boc-Py-Im acid (HBTU, DIEA) , 
all other residues were added as appropriate activated Boc 
protected monomer units. A sample of resin (250 mg, 0.16 
mmol/gram 21 ) was placed in a 20 mL glass scintillation vial, 
2 mL dimethylaminopropylamine added and the mixture allowed 
to stand at 55 °C for 18 hours. Resin was removed by 
filtration through a disposable propylene filter, and the 
resulting solution diluted with water to a total volume of 
8 mL, and purified directly by preparatory reversed phase 
HPLC to provide ImPylmPy-y-ImPylmPy-p-Dp (19 mg, 32% 
recovery) as a white powder. UV X Tnax (H 2 0) 246, 312 (66,000); 
X H NMR (DMSO-d 5 ) d 10.33 (m, 2 H) ; 10.25 (m, 2 H) ; 10.04 (m, 
2 H) ; 9.95 (s, 1 H) ; 9.5 (br s, 1 H) , 8.10 (m, 3 H) ; 7.57 
(m, 2 H); 7.48 (s, 1 H) ; 7.42 (s, 1 H) ; 7.40 (s, 1 H) ; 7.23 
(m, 2 H), 7.17 (d, 1 H; J = 1.5 Hz); 7.03 (d, 1 H, J = 1.5 
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Hz); 6.98 (m, 3 H) ; 4.02 (s, 3 H) ; 3.99 (m f 6 H) ; 3.81 (m f 
6 H) ; 3.97 (s, 3 H) ; 3.88 (m, 6 H) ; 3.83 <m, 6 H) ; 3.42 (m, 
2 H) ; 3.18 (t, 2 H, J = 5.2 Hz) 3.06 (m, 2 H, J 5.5 Hz); 
2.80 (m, 2 H, J = 4.7 Hz); 2.76 (d, 6 H, J = 4.4 Hz); 2.38 
5 (m, 4 H) ; 1.93 (m, 4 H) ; MALDI -TOF-MS , 1225.2. (1225.3 
calc . for M+H) . 



52 • Imlmlmlm-y- PyPyPyPy- P - Dp 

9 i 




10 The polyamide Imlmlmlm-y-PyPyPyPy-p-PAM-Resin was 

assembled on 0.2 mmol/gram Boc-b-PAM- resin by manual 
polyamide synthesis. The y-Im step was added using Boc-y-Im 
acid ( HBTU , DIEA) , all other residues were added as 
appropriate activated Boc protected monomer units. A sample 

15 of resin (250 mg, 0.16 mmol/gram 21 ) was placed in a 20 mL 
glass scintillation vial, 2 mL dimethylaminopropylamine 
added and the mixture allowed to stand at 55 °C for 18 
hours. Resin was removed by filtration through a disposable 
propylene filter, and the resulting solution diluted with 

20 water to a total volume of 8 mL, and purified directly by 
preparatory reversed phase HPLC to provide Imlmlmlm-y- 
PyPyPyPy-P-Dp (12 mg, 21% recovery) as a white powder. UV 
X max (H 2 0) 246, 314 (66,000); X H NMR (DMS0-d 6 ) d 9.91 (m, 2 
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H) 


; 9 


.89 (m, 4 
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; 2.32 



(m, 4 H); 1.70 (m, 4 H) ; MALDI -TOF-MS , 1225.6 {1225.3 calc. 
for M+H) . 

10 



5 3 . ImlmlmPy -[J- Py Py Py Py - p - Dp 




A sample of ImlmlmPy-p-PyPyPyPy-p-resin prepared by 
machine-assisted solid phase synthesis (240 mg, 0.16 

15 mmol/gram) was placed in a 20 mL glass scintillation vial, 
and treated with dimethylaminopropylamine (2 mL) at 55 °C 
for 18 hours. Resin was removed by filtration, and the 
filtrate diluted to a total volume of 8 mL with 0.1 % 
(wt/v) aqueous TFA. The resulting crude polyamide/amine 

20 solution was purified directly by reversed phase HPLC to 
provide the trif luoroacetate salt of ImlmlmPy-p-PyPyPyPy-p- 
Dp (31 mg, 40% recovery) as a white powder. X H NMR (300 MHz, 
[D 6 ]DMSO, 20 °C): d = 10.37 (s, IH; NH) , 10.12 (s, IH; NTH) , 
9,95 (s, IH; NH), 9.94 (s, IH; NH) , 9.93 (s, IH; NH) , 9.92 

25 (s, IH; NH), 9.59 (s, IH; NH) , 9.4 (br s, IH; CF 3 COOH) , 8.09 
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(m, 3H; NH), 7.65 (s, 1H; CH) , 7.56 (s, 1H; CH) , 7.45 (s, 
1H; CH) , 7.27 (d, 2 J(H,H) = 1.3 Hz, 1H; CH) , 7.22 (m, 2H; 
CH) , 7.18 (d, 2 J(H,H) = 1.2 Hz, 1H; CH) , 7.16" (d, 2 J(H,H) = 
1.0 Hz, 1H; CH), 7.07 (m, 2H; CH) , 6.95 (d, 2 J(H,H) = 1.1 
5 Hz, 1H; CH) , 6.88 <d, 2 J(H,H) = 1.4 Hz, 1H; CH) , 6.86 (d, 
2 i7(H,H) = 1.3 Hz, 1H; CH) , 4.01 (s, 3H; NCH 3 ) , 3.98 (m, 2H; 
NCH 3 ) , 3.83 (s, 3H; NCH 3 ) , 3.82 (m, 6H; NCH 3 ) , 3.80 (s, 3H; 
NCH 3 ) , 3.78 (s, 3H; NCH 3 ) , 3.4 (m, 6H; CH 2 ) , 3.11 (q, V(H,H) 
= 5.2 Hz, 2H; CH 2 ) , 2.94 (q, 4 J(H,H) = 5.3 Hz, 2H; CH 2 ) , 
10 2.69 (d, 2 J(H,H) - 4.4 Hz, 6H; N(CH 3 ) 2 ), 2.33 (t, 3 J(H,H) = 
5.4 Hz, 2H; CH 2 ) , 1.75 (q, V(H,H) = 7.1 Hz, 2H; CH 2 ) ; 
UV/VIS (H 2 0) ^ (q) = 304 (66,600, calculated from e = 
8,333/ring ll4cJ ) , 241 nm; MALDI -TOF-MS [M + -H] 1210.4: calc. 
1210.3 . 

15 



54 . ImlmPyPy- p - PyPy Py Py - p - Dp 




A sample of ImlmPyPy-p-PyPyPyPy-p- resin prepared by 
machine-assisted solid phase synthesis (240 mg, 0.16 

20 mmol/gram 15 ) was placed in a 20 mL glass scintillation vial, 
and treated with dimethylaminopropylamine (2 mL) at 55 °C 
for 18 hours. Resin was removed by filtration, and the 
filtrate diluted to a total volume of 8 mL with 0.1 % 
(wt/v) aqueous TFA. The resulting crude polyamide /amine 

25 solution was purified directly by reversed phase HPLC to 
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provide the trif luoroacetate salt of ImImPyPy-p-PyPyPyPy-p_ 
Dp (31 mg, 40% recovery) as a white powder. X H NMR (300 MHz, 
[D 5 ]DMSO, 20 °C) : 8 = 10.38 (s, 1H; NH) , 9.95 (s, 1H; NH) , 
9,93 (s, 1H; NH) , 9.91 (s, 1H; NH) , 9.90 (m, 2H; NH) , 9.76 
5 (s, 1H; NH) , 9.4 (br s, 1H; CF 3 COOH) , 8.09 (m, 3H; NH) , 7.56 
(s, 1H; CH) , 7.46 (s, 1H; CH) , 7.27 (d, 2 J(H,H) = 1.8 Hz, 
1H; CH) , 7.21 (d, V(H,H) = 1.7 Hz, 1H; CH) , 7.20 (d, 
2 J(H,H) = 1.9 Hz, 1H; CH) , 7.19 (d, 2 J"(H,H) = 1.9 Hz, 1H; 
CH) , 7.16 (d, 2 J(H,H) = 1.9 Hz. 1H; CH) , 7.15 (d, 2 J(H,H) = 

10 1.6 Hz, 1H; CH) , 7.14 (d, 2 J(H,H) =1.9 Hz, 1H; CH) , 7.12 
(d, 2 J(H,H) = 1.6 Hz, 1H; CH) , 7.07 (s, 1H; CH) , 7.05 (d, 
2 J(H,H) = 1.5 Hz, 1H; CH) , 6.87 (d, 2 cT(H,H) = 1.9 Hz, 1H,- 
CH) , 6.86 (d, 2 J(H,H) = 1.6 Hz, 1H; CH) , 6.84 (d, 2 J(H,H) = 
1.6 Hz, 1H; CH) , 3.99 (m, 6H; NCH 3 ) , 3.82 (m, 12H; NCH 3 ) , 

15 3.80 (s, 3H; NCH 3 ) , 3.78 (s, 3H; NCH 3 ) , 3.4 (m, 6H; CH 2 ) , 
3.09 (q, 4 J(H,H) = 5.6 Hz, 2H; CH 2 ) , 2.97 (q, V(H,H) = 5.2 
Hz, 2H; CH 2 ) , 2.71 (d, 2 J(H,H) = 4.2 Hz, 6H; N(CH 3 ) 2 ), 2.32 
(t, 3 J(H,H) = 5.1 Hz, 2H; CH 2 ) , 1.71 (q, 5 J(H,H) = 7.4 Hz, 
2H; CH 2 ) ; UV/VIS (H 2 0) (e) = 306 (66,600, calculated 

20 from e = 8,333/ring), 243 nm; MALDI -TOF-MS [M + -H] 1209.1: 
calc. 1209.3 
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A sample of ImPyPyPy-p-PyPyPyPy-p-resin prepared by 
machine-assisted solid phase synthesis (240 mg, 0.16 
mmol/gram) was placed in a 20 mL glass scintillation vial, 
and treated with dimethylaminopropylamine (2 mL) at 55 °C 
5 for 18 hours. Resin was removed by filtration, and the 
filtrate diluted to a total volume of 8 mL with 0.1 % 
(wt/v) aqueous TFA . The resulting crude polyamide/amine 
solution was purified directly by reversed phase HPLC to 
provide the trif luoroacetate salt of ImPyPyPy-p-PyPyPyPy.p- 

10 Dp (31 mg, 40% recovery) as a white powder. X H NMR (300 MHz, 
[D 6 ]DMSO, 20 °C) : 5 = 10.49 (s, 1H; NH) , 9.97 (s, 1H; NH) , 
9.95 (s, 1H; NH) , 9.94 (s, 1H; NH) , 9.93 (m, 2H; NH) , 9.91 
(s, 1H; NH) , 9.4 (br s, 1H; CF 3 COOH) , 8.10 (m, 3H; NH) , 7.38 
(s, 1H; CH) , 7.28 (d, 2 J(H,H) = 1.6 Hz, 1H; CH) , 7.22 (m, 

15 3H; CH), 7.19 (m, 2H; CH) , 7.16 (m, 2H; CH) , 7.09 (m, 2H; 
CH), 7.04 (m, 2H; CH) , 6.87 (d, 2 J(H,H) = 1.6 Hz, 1H; CH) , 
6.86 (d, 2 i7(H,H) = 1.6 Hz, 1H; CH) , 6.84 (d, 2 <J(H,H) = 1.5 
Hz, 1H; CH), 3.97 (s, 3H; NCH 3 ) , 3.82 (m, 15H; NCH 3 ) , 3.80 
(s, 3H; NCH 3 ) , 3.78 (s, 3H; NCH 3 ) , 3.4 (m, 6H; CH 2 ) , 3.10 

20 (q, 4 J(H,H) = 5.4 Hz, 2H; CH 2 ) , 2.98 (q, 4 J(H,H) = 5.3 Hz, 
2H; CH 2 ), 2.72 (d, 2 J(H,H) = 4.7 Hz, 6H; N(CH 3 ) 2 ), 2.33 (t, 
3 J(H,H) = 7.0 Hz, 2H; CH 2 ) , 1.71 (q, 5 J(H,H) = 6.4 Hz, 2H; 
CH 2 ); UV/VIS (H 2 0) (e) = 312 (66,600, calculated from e 

= 8,333/ring), 244 nm; MALDI-TOF-MS [M + -H] 1208.2: calc. 

25 1208.3. 
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56. ImPy Py Py Py - y - ImPy Py Py Py - p - Dp 

9 I 




A sample of ImPyPyPyPy~y~ ImPyPyPyPy-p- resin prepared by 
machine-assisted solid phase synthesis (240 mg, 0.16 
5 mmol/gram) was placed in a 20 mL glass scintillation vial, 
and treated with dimethyl ami nopropyl amine (2 mL) at 55 °C 
for 18 hours. Resin was removed by filtration, and the 
filtrate diluted to a total volume of 8 mL with 0.1 % 
(wt/v) aqueous TFA. The resulting crude polyamide/amine 

10 solution was purified directly by reversed phase HPLC to 
provide the trif luoroacetate salt of ImPyPyPyPy-y- 
ImPyPyPyPy-p-Dp (13 mg, 18% recovery) as a white powder. UV 
(H 2 0) 241, 316 (e) 83300 (calculated based on e = 

8,333/ring); X H NMR (DMSO-d*) 8 10.52 (s, 1 H) , 10.29 (s, 1 

15 H) , 10.04 (s, 1 H) , 10.00 (s, 1 H) , 9.97 (m, 3 H) , 9.92 (m, 
2 H) , 9.22 (br s, 1 H) , 8.06 (m, 3 H) , 8.03 (m, 2 H) , 7.46 
(s, 1 H) , 7.41 (s, 1 H) , 7.29 (d, 1 H, J= 1.0 Hz), 7.23 (m, 
1 H) , 7.17 (m, 1 H), 7.07 (m, 1 H) , 6.90 (d, 1 H, J= 6.9 
Hz), 3.99 (s, 3 H) , 3.94 (s, 3 H) , 3.85 (m, 6 H) , 3.79 (s, 

20 3 H) , 3.38 (q, 2 H, J= 3.2 Hz), 3.20 (q, 2 H, J=2 . 7 Hz), 
3.11 (q, 2 H, .7=1.8 Hz), 3.00 (q, 2 H, J=2 . 1 Hz), 2.72 (d, 
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6- H, J=4.8 Hz) , 2.35 (m, 4 H) , 1.75 (m, 4H) ; MALDI -TOF-MS , 
1466.1 (1467.6 calc. for M+H) 



5 7 . ImlmPy Py Py - y - ImPy Py Py Py - P - Dp 



5 




The polyamide was prepared as described for 
ImPyPyPyPy-y-ImPyPyPyPy-p-Dp as a white powder (28 mg, 34% 
recovery). UV A^*, 310 (83,300); X H NMR (DMSO-d 6 ) d 10.38 
(s, 1 H) , 10.28 (s, 1 H) , 10.02 (s, 1 H) , 9.99 (s, 1 H) , 

10 9.96 (m, 2 H), 9.91 (s, 2 H) # 9.76 (s, 1 H) , 9.18 (br s, 1 
H) , 8.05 (m, 3 H) , 7.57 (s, 1 H) , 7.46 (s, 2 H) , 7.25 (dd, 
2 H, 17=5.6), 7.23 (m, 4 H) , 7.16 (m, 4 H) , 7.07 (m, 4 H) , 
6.88 (d, 1 H, J=5.1), 4.00 (s, 3 H) , 3.94 (s, 3 H) , 3.85 
(m, 6H) , 3.79 (s, 3 H) , 2.99 (q, 2 H, J = 5.1), 2.73 (d, 6 

15 H, J = 4.8 Hz), 2.34 (m, 4 H) , 1.75 (m, 4 H) ; MALDI -TOF-MS , 
1468.2 (1468.6 calc. for M+H) . 
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5 8 . ImPy Py Py Py - y - ImPy Py Py Py ~P-Dp- NH 2 




A sample of ImPyPyPyPy-y- ImPyPyPyPy-p-resin prepared by 
machine-assisted solid phase synthesis (240 mg, 0.16 
5 mmol/gram) was placed in a 20 mL glass scintillation vial, 
and treated with 3 , 3 -diamino-iV-methyldipropylamine (2 mL) 
at 55 °C for 18 hours. Resin was removed by filtration, and 
the filtrate diluted to a total volume of 8 mL with 0.1 % 
(wt/v) aqueous TFA. The resulting crude polyamide/amine 
10 solution was purified directly by reversed phase HPLC to 
provide the trif luoroacetate salt of ImPyPyPyPy-y- 
ImPyPyPyPy-p-NH 2 (31 mg, 40% recovery) as a white powder. UV 
X^ax 241, 316 (e) 83300 (calculated based on e 
8,333/ring 5 ) ; X H NMR (DMSO-d*) 5 10.53 (s, 1 H) , 10.28 (s, 1 
15 H), 10.03 (s, 1 H), 10.00 (s, 1 H) , 9.96 (m, 2 H) , 9.92 (m, 
2 H) , 9.6 (br s, 1 H) , 8.07 (m, 4 H) , 7.89 (s, 3 H) , 7.45 
(s, 1 H) , 7.41 (s, 1 H), 7.27 (d, 2 H, J= 8.5 Hz), 7.23 (m, 
4 H) , 7.16 (m, 4 H) , 7.06 (m, 4 H) , 6.87 (m, 2 H) , 3.98, 
(s, 3 H), 3.94 (s, 3 H), 3.84, (m, 6 H) , 3.79 (s, 3 H) , 
20 3.35 (q, 2 H, J= 5.7 Hz), 3.16 (m, 8 H) , 2.85 (q, 2 H, J= 
5.6 Hz), 2.72 (d, 2 H, J= 4.2 Hz), 2.34 (m, 2 H) , 1.91 (m, 
4 H) , 1.78 (m, 4 H) . MALDI -TOF MS, 1510.4 (1510.7 calc. 
for M+H) . 
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5 9 . ImlmPyPyPy - y - ImPyPyPyPy - (J - Dp - NH 2 




NH 2 



The polyamide was prepared as a white powder as 
5 described for ImPyPyPyPy-y-ImPyPyPyPy-p-NH 2 . X K NMR (DMSO- 
d 6 ) 5 10.39 ( S/ 1 H) , 10.28 ( S/ 1 H) , 10.03 (s, 1 H) , 10.00 
(s, 1 H), 9.92 (m, 2 H) , 9.82 (s, 1 H) , 9.66 (br S/ 1 H) , 
8.11 (m, 4 H) , 7.89 (s, 3 H) , 7.57 (s, 1 H) , 7.46 (d, 2 H, 
J=2.4 Hz), 7.27 (dd, 2 H, J= 1.0 Hz) 7.23 (m, 4 H) , 7.16 
10 (m, 4 H) , 7.08 (m, 4 H) , 6.88 (m, 1 H) , 4.00 (s, 3 H) , 3.94 
(s, 3 H), 3.78 (s, 3 H) , 3.19 (q 7 2 H, J=5 . 1 Hz), 3.05 (m, 
8 H) , 2.86 (q, 2 H, <J= 4.8 Hz), 2.72 (d, 2 H, J= 4.4 Hz), 
2.34 (m, 4 H) , 1.90 (m, 4 H) , 1.78 (m, 4 H) . MALDI -TOF-MS , 
1510.4 (1511.7 calc. f or M+H) . 



6 0 . ImPy Py Py Py - y - ImPy Py Py Py ~[}-Dp- EDTA 

EDTA-di anhydride (50 mg) was dissolved by heating at 
55 °C for 5 min. in a solution of DMSO/NMP (1 ml) and DIEA 
(1 mL) . The dianhydride solution was added to ImPyPyPyPy-y- 
20 ImPyPyPyPy-p-Dp-NH 2 (8.1 mg) dissolved in DMSO (750 /iL) . 
The mixture was heated at 55 °C for 25 minutes, and treated 
with 0.1M NaOH (3 mL) , and heated at 55 °C for 10 minutes. 
Aqueous 0.1% (wt/v) TFA was added to adjust the total 
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volume to 8 mL and the solution purified directly by 
preparatory HPLC chromatography to provide ImPyPyPyPy-y- 
ImPyPyPyPy-p-Dp-EDTA as a white powder. (2.4 mg, 22% 
recovery) MALDI-TOF-MS, 1766.4 (1766.7 calc. for M+H) . 

5 

E. Plasmids, Footprinting, Affinity Cleavage 

1. Construction of plasmid DNA 

The experimental target plasmid pSES9hp was 
constructed by hybridization of the inserts: 
10 5 ' - GATCCTATGTCAGTCATGGGGATGACTGTCAGTCATGGCCATGACTGTCAGTCAT 
GCGCATGACTGTCAGTCTTAAGC - 3 ' and 

5 ' - GATACAGTCAGTACCCCTACTGACAGTCAGTACCGGTACTGACAGTCAGTACGCG 
TACTGACAGTCAGAATTCGTCGA-3 " . 

The hybridized insert was ligated into linearized pUC19 
15 BanHl / Hindlll plasmid using T4 DNA ligase. The resultant 
constructs were used to transform ToplOF' OneShot competent 
cells from Invitrogen. Ampicillin-resistant white colonies 
were selected from 2 5 mL Luria-Bertani medium agar plates 
containing 50 iig/mL ampicillin and treated with XGAL and 
20 IPTG - solutions. Large-scale plasmid purification was 
performed with Qiagen Maxi purification kits. Dideoxy 
sequencing was used to verify the presence of the desired 
insert. Concentration of the prepared plasmid was 

determined at 260 nm using the relationship of 1 OD unit = 
25 50 jig/mL duplex DNA. 

2. Preparation of 3'- and 5 '-End-Labeled Restriction 
Fragments 

The plasmid pSES9hp was linearized with EcoRI and 
30 PvuII and then treated with Klenow fragment, deoxy adenosine 
5'- [a- 32 P] triphosphate and thymidine 5'- [a- 32 P] triphosphate 
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for 3" labeling. Alternatively, pSES9hp was linearized 
with EcoRI, treated with calf alkaline phosphatase, and 
then 5' labeled with T4 polynucleotide * kinase and 
deoxyadenosine 5 [y- 32 P] triphosphate . The 5" labeled 

5 fragment was then digested with PvuII. The labeled 
fragment (3' or 5') was loaded onto a 5% non-denaturing 
polyacryl amide gel, and the desired 282 base pair band was 
visualized by autoradiography and isolated. Chemical 
sequencing reactions were performed according to published 
10 methods. (Maxam, A.M. U Gilbert, W.S. (1980). Sequencing 
End-Labeled DNA with Base-Specific Chemical Cleavages. 
Methods Enzymol . 65, 499-560; Iverson, B.L. & Dervan, P.B. 
(1987) . Adenine-specif ic DNA chemical sequencing reaction. 
Methods Enzymol. 15, 7823-7830.) 

15 

3. MPE»Fe(II) Footprinting 

All reactions were carried out in a volume of 40 /xL. 
A polyamide stock solution or water (for reference lanes) 
was added to an assay buffer where the final concentrations 
were: 25 mM Tris-acetate buffer (pH 7.0), 10 mM NaCl, 100 
/zM/base pair calf thymus DNA, and 30 kcpm 3"- or 5'- 
radiolabeled DNA. The solutions were allowed to 

equilibrate for 4 hours. A fresh 50 /iM MPE*Fe(II) solution 
was made from 100 /zL of a 100 /iM MPE solution and 100 /iL of 
a 100 /iM ferrous ammonium sulfate (Fe (NH 4 ) 2 (S0 4 ) 2 # 6H 2 0) 
solution. MPE*Fe(II) solution (5 /iM) was added to the 
equilibrated DNA, and the reactions were allowed to 
equilibrate for 5 minutes. Cleavage was initiated by the 
addition of dithiothreitol (5 mM) and allowed to proceed 
30 for 14 min. Reactions were stopped by ethanol 

precipitation, resuspended in 100 mM tris-borate-EDTA/80% 



20 
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formamide loading buffer, denatured at 85°C for 5 min, 
placed on ice, and half of each tube (~ 15 kcpm) was 
immediately loaded onto an 8% denaturing polyacrylamide gel 
(5% crosslink, 7 M urea) at 2000 V. 

5 

4. Affinity Cleaving. All reactions were carried 
out in a volume of 4 0 /xL. A polyamide stock solution or 
water (for reference lanes) was added to an assay buffer 
where the final concentrations were: 25 mM Tris-acetate 

10 buffer (pH 7.0), 10 mM NaCl , 100 /iM/base pair calf thymus 
DNA, and 2 0 kcpm 3'- or 5 ' -radiolabeled DNA. The solutions 
were allowed to equilibrate for 4 hours. A fresh solution 
of ferrous ammonium sulfate (Fe (NH 4 ) 2 (S0 4 ) 2 # 6H 2 0) (10 fiM) was 
added to the equilibrated DNA, and the reactions were 

15 allowed to equilibrate for 15 minutes. Cleavage was 
initiated by the addition of dithiothreitol (10 mM) and 
allowed to proceed for 30 min. Reactions were stopped by 
ethanol precipitation, resuspended in 100 mM tris-borate- 
EDTA/80% formamide loading buffer, denatured at 85°C for 5 

20 min, placed on ice, and the entire sample was immediately 
loaded onto an 8% denaturing polyacrylamide gel (5% 
crosslink, 7 M urea) at 2000 V. 

5 . Identification of Binding Orientation by Affinity 
25 Cleaving . 

Affinity cleavage assays (25 mM Tris-acetate, 10 mM 
NaCl, 100 /iM/base pair calf thymus DNA, pH 7.0 and 22 °C) 
were performed in order to identify the binding 
orientations of the EDTA analogues of the three hairpin 
30 polyamides : ImlmPyPy-y-ImlmPyPy-p-Dp-EDTA, ImPylmPy-y- 
ImPylmPy-p-Dp-EDTA , and Imlmlmlm-y-PyPyPyPy-p-Dp-EDTA 
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The polyamides ImlmPyPy-y-ImlmPyPy-P-Dp-EDTA, ImPylmPy-y- 
ImPylmPy-p-Dp-EDTA recognize their respective palindromic 
match sequences, 5 ' -TGGCCA-3 " and 5 " -TGCGCA-3 ' , in two 
equivalent orientations, consistent with hairpin formation. 
5 In contrast, the polyamide Imlmlmlm-y-PyPyPyPy-p-Dp-EDTA 
recognizes a non-palindromic sequence, 5 " -TGGGGA-3 ' , in a 
single orientation with cleavage visible only on the 5"- 
side of the site, as predicted by the hairpin model. 

Depicted in Figure 17 is a representative affinity 

10 cleaving experiment on a 3 ' - 32 P-labeled 282 bp EcoRl/PvuII 
restriction fragment from plasmid pSES9hp. The 5'-TGGCCA- 
3", 5"-TGCGCA-3 and 5 ' -TGGGGA- 3 " sites are shown on the 
right side of the autoradiogram. Lane 1, A reaction; lane 
2, C reaction; lanes 3-5, 1 /xM, 2 fiM and 5 jiM ImlmPyPy-y- 

15 ImlmPyPy-p-Dp-EDTA (1-E) ; lanes 6-8, 1 //M, 2 fM and 5 /xM 
ImPylmPy-y-ImPylmPy-P-Dp-EDTA (2-E) ; lanes 9-11, 1 M M, 2 /iM 
and 5 /zM Imlmlmlm-y-PyPyPyPy-p-Dp-EDTA (3-E) ; lane 12, 
intact DNA. All lanes contain 15 kcpm 3 " -radiolabeled DNA, 
25 mM Tris-acetate buffer (pH 7.0)', 10 mM NaCl , and 100 

20 /zM/base pair calf thymus DNA. (Right) Affinity cleavage 
patterns for ImlmPyPy-y-ImlmPyPy-p-Dp-EDTA and ImPylmPy-y- 
ImPylmPy-p-Dp-EDTA at 1 fiM concentration, and Imlmlmlm-y- 
PyPyPyPy-p-Dp-EDTA at 5 /iM concentration. Illustration of 
the 282 bp restriction fragment with the position of the 

25 sequence indicated. Bar heights are proportional to the 
relative protection from cleavage at each band. Boxes 
represent equilibrium binding sites determined by the 
published model, and only sites that were quantitated by 
DNase I footprint titrations are boxed. 

30 
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6. DNase I Pootprinting 

All reactions were carried out in a volume of 400 /zL. 
We note explicitly that no carrier DNA was used in these 
reactions. A polyamide stock solution or water (for 
reference lanes) was added to an assay buffer where the 
final concentrations were: 10 mM Tris*HCl buffer (pH 7.0), 
10 mM KC1, 10 mM MgCl 2 , 5 mM CaCl 2 , and 20 kcpm 3'- 
radiolabeled DNA. The solutions were allowed to 

equilibrate for a minimum of 12 hours at 22 °C. Cleavage 
was initiated by the addition of 10 \iL of a DNase I stock 
solution (diluted with 1 mM DTT to give a stock 
concentration of 0.28 u/mL) and was allowed to proceed for 
5 min at 22 °C. The reactions were stopped by adding 50 mL 
of a solution containing 2.25 M NaCl, 150 mM EDTA, 0.6 
mg/mL glycogen, and 30 mM base-pair calf thymus DNA, and 
then ethanol precipitated. The cleavage products were 
resuspended in 100 mM tris-borate-EDTA/80% formamide 
loading buffer, denatured at 85°C for 5 min, placed on ice, 
and immediately loaded onto an 8% denaturing polyacryl amide 
gel (5% crosslink, 7 M urea) at 2000 V for 1 hour. The 
gels were dried under vacuum at 80°C, then quantitated 
using storage phosphor technology. 

The data were analyzed by performing volume 
integrations of the 5 ' -TGGCCA-3 " , 5 ' -TGCGCA-3 ' , and 5'- 
25 TGGGGA-3" sites and a reference site. The apparent DNA 
target site saturation, 8 a p p/ was calculated for each 
concentration of polyamide using the following equation: 



15 



20 



G = 1 - 
app | 
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where I to t and I re f are the integrated volumes of the 
target and reference sites, respectively, and Itot° and 
Iref° correspond to those values for a DNase I control 
lane to which no polyamide has been added. The ([L]tot* 
5 ©app) data points were fit to a Langmuir binding isotherm 
(eq 2, n=lor n=2) by minimizing the difference between O a pp 
and 9fit/ using the modified Hill equation: 



9 fit e min + ^max^min) * ^ 

1+K a n W n tot 

10 (2) 



where [L] tot corresponds to the total polyamide 
concentration, K a corresponds to the equilibrium 
association constant, and 6min and 9 ma x represent the 

15 experimentally determined site saturation values when the 
site is unoccupied or saturated, respectively. Data were 
fit using a nonlinear least -squares fitting procedure of 
KaleidaGraph software (version 2.1, Abelbeck software) with 
k 6li ©max, and 0 m i n as the adjustable parameters. All 

20 acceptable fits had a correlation coefficient of R > 0.97. 
At least three sets of acceptable data were used in 
determining each association constant. All lanes from each 
gel were used unless visual inspection revealed a data 
point to be obviously flawed relative to neighboring 

25 points. The data were normalized using the following 
equation: 
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fi app min 

norm ~ " 

max u min ^ 

7, Quantitation by Storage Phosphor Technology 
Autoradiography 

5 Photostimulable storage phosphorimaging plates (Kodak 

Storage Phosphor Screen S0230 obtained from Molecular 
Dynamics) were pressed flat against gel samples and exposed 
in the dark at 22°C for 12-20 h. A Molecular Dynamics 400S 
Phosphorlmager was used to obtain all data from the storage 

10 screens. The data were analyzed by performing volume 
integrations of all bands using the ImageQuant v. 3.2. 



EXAMPLE 2 

Synthesis And Oxidative Cleavage Of Double -Helical Dna By 
15 Poly-amides Modified With A Polyamide -Ni ( Ii ) Tripeptide 

Conjugate 

Many anticancer drugs act through their ability to 
modify DNA. Novel polyamide conjugates have been designed 
which modify double-helical DNA in a sequence specific 

20 manner. More specifically the metalopeptide Ni (II) •Gly-Gly- 
His has been covalently attached to a pyrrole- imidazole 
polyamide. The conjugate was synthesized using manual solid 
phase synthesis protocols developed by the Dervan group 
using Boc- pyrrol e-OBt ester and Boc- imidazole acid 

25 monomers, activated esters of y-aminobutyric acid and 0- 
alanine, and Boc-p-alanine-Pam resin. Individual polyamides 
are purified by reversed phase HPLC and characterized by 
MALDI-TOF mass spectrometry. 

The metallopeptide Ni (II) •Gly-Gly-His has been shown 

30 to promote the efficient oxidative cleavage of DNA in the 



135 



WO 98/49142 



PCT/US98/06997 



10 



15 



20 



presence of monoperoxyphthal ic acid. (Mack and Dervan, J. 
Am. Chem. Soc . , 112, 4604 (1990); Mack and Dervan, 
Biochemistry, 31, 9399 (1992)). 

The reaction is thought to proceed through a mechanism 
that involves abstraction of hydrogen atom(s) from the 
deoxyribose backbone of DNA by a nondiff usable high valent 
nickel bound oxygen. Bifunctional conjugates were designed 
in order to combine the ability of polyamides to recognize 
any predetermined DNA sequence with the Ni (II) •Gly-Gly-His 
chemistry. The symmetric anhydride of the amino acid His 
and the activated ester of Gly were coupled to the extended 
hairpin polyamide directly on the p-alanine-Pam resin 
employing solid phase chemistry protocols. Denaturing 
polyacrylamide gel electrophoresis of 32P end-labeled DNA 
treated with the Ni (II) .Gly-Gly-His modified polyamide at 
pH 7.5 demonstrated the ability of the conjugate to cleave 
the double helical DNA in a sequence selective manner in 77 
% and 72 % yields on the 3 ' -end-labeled DNA (at 10 nM 
polyamide). The chemical structure of the Ni (II) •Gly-Gly- 
His modified polyamide is shown in Figure 18. 



EXAMPLE 3 

Sequence Specific Alkylation Of Dna By Pyrrole -Imidazole 
Polyamides Modified With Dna Reactive Agents 

The design of sequence specific DNA binding-modifying 
molecules requires the integration of two separate 
entities: recognition and functional reactivity. The 
present inventor has discovered ligands which combine 
pyrrole- imidazole polyamide DNA binding motifs with 
mechanism based reactive functionalities capable of 
electrophilic modification of bases in the minor groove. 
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The design of sequence specific molecules for 
alkylation of double helical DNA requires both a specific 
DNA binding molecule and an atom specific DNA cleaving 
moiety. Hairpin polyamides are sequence specific molecules 
5 that can bind to any predetermined DNA sequence. 
Bromoacetyl and the prodrug analogue of the cyclopropyl 
elect rophile of CC-1065 react in an atom specific manner 
with double helical DNA. By tethering a bromoacetyl moiety 
or the prodrug analogue of the cyclopropyl electrophile of 
CC-1065 to a hairpin polyamide the present inventor has 
discovered a sequence specific DNA alkylating agent which 
can be targeted to any predetermined DNA sequence at 
subnanomolar concentration. 

The two criteria for successful bifunctional molecule 
design are sequence specific reactions at designated single 
atoms within the bound complex, and cleavage yields that 
are quantitative under physiological conditions (i.e. 

neutral pH, 37° C, 100-200nM KCl/NaC-1) . In order to 
maximize stoichiometric reaction on the DNA, the 'cleaving 
functionality' must be sufficiently reactive with DNA at 

3 7° C, be inert in aqueous media, and not react with buffer 
components, and not suffer unimolecular decomposition in 
competition with desired reactions on DNA. In order to 
design such bifunctional molecules, hairpin polyamides 
equipped with either an N- terminal bromoacetyl group or a 
prodrug analogue of the cyclopropyl electrophile of CC-1065 
have been prepared. 



A. Bromoacetylated polyamides 

30 The polyamide NH 2 PyPyPyPy-g-ImPyPyPy-b-Dp was designed 

to target the sequence 5' -AGTTT*A-3 f . T* indicates the 
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thymine opposite the alkylated adenine. The polyamide was 
synthesized by solid phase protocols, cleaved from the 
solid support with dimethyl amino propylamine, and purified 
by reverse phase HPLC chromatography. The terminal pyrrole 
5 residue was deprotected and left unacetylated, leaving a 
free primary amine on the N- terminus. 

In order to bromoacetylate the polyamide, bromoacetic 
acid was activated with HOBt and DCC in 1 ml DMF. After 5 
minutes, the DCU was filtered off and the solution added to 
10 the polyamide with DIEA. After 15 minutes, .the reaction 
mixture was purified directly by reversed phase HPLC to 
isolate the bromoacetylated polyamide. Short reaction times 
were used to avoid alkylation of the unprotected imidazole 
ring nitrogen. The purified N-bromoacetyl hairpin 

polyamide was characterized by mass spectrometry. The 
synthesis of a bromoacetylated hairpin polyamide is 
described in Figure 19. 

Another set of polyamides was synthesized, based on an 
extended hairpin motif. This motif combines the y-turn of 
the hairpin motif with the p-alanine spacer of the extended 
motif, combining the 2:1 binding mode with the 1:1 binding 
mode. The following compounds were synthesized: PyPy-p- 
PyPyPy-y-ImPyPy-P-Dp, PyPyPy-P-PyPyPy-y-lmPyPy-p-Dp, PyPy-p- 
PyPyPy-y-PyPylm-P-Dp, and PyPyPyPyPyPy-y-PyPyim-p-Dp . The 
syntheses of the bromoacetylated extended hairpins were 
successful, and were prepared as described for the 
bromoacetylated hairpin polyamide. As controls, all four 
of the acetylated compounds were made as well. 
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B. 



Typical manual solid phase polyamide synthesis of 
Py Py PyPy - y - ImPy Py Py -p-Dp. 



Boc-p-alanine-Pam resin (1.25 g, 0.25 mmol) was placed 
in a 20 ml glass reaction vessel and shaken in DMF for 5 
5 minutes and drained. The resin was washed with DCM (2 
volumes) and deprotected with 80% TFA/DCM/0.5 M PhSH (1 
wash, 1 X 20 minutes) . Following deprotection, the resin 
was washed 3 time with DCM and 1 time with DMF. Boc- 
pyrrole-OBt ester (357 mg, 1 mmol) was added in 2 ml of DMF 

10 followed by 1 ml DIEA. The coupling reaction was shaken 
vigorously for 45 minutes. Resin samples (5 mg)were taken 
periodically to monitor the synthesis by HPLC. Successive 
cycles of the remaining monomers, Boc-Py-OBt (2X) , Boc-y-Im- 
COOH, Boc-Py-OBt (4X) . Boc-y- Im-COOH was activated by 

15 addition of HBTU (378 mg, 1 mmol) in 2 ml of DMF. DIEA (1 
ml) was added and the solution was allowed to stand for 5 
minutes until clear. After completion of the synthesis, 
the resin was washed with DMF, DCM, methanol, and ethyl 
ether. The resin was then lyophilized to remove solvent. 

20 The polyamide was cleaved off the resin with (N,N) - 
dimethylamino propylamine (2 ml) in a glass scintillation 

vial at 55°C for 12 hours. The polyamide was filtered and 

HPLC purified in 0.1% TFA with a 0.25% CH3CN min" 1 
gradient. 



C. Synthesis of bromoacetylated polyamides 

Bromoacetic acid (65 mg, 0.5 mmol) and 
hydroxybenzotriazole (65 mg, 0.5 mmol) were dissolved in 1 
ml DMF. DCC (102 mg, 0.5 mmol) was added. After 5 
30 minutes, the DCU was filtered off, and the solution added 
to the polyamide (10 mg, 0.1 mmol). The filter was washed 
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with 1 ml DMF, and 300 ul DIEA which was added to the 
reaction. The reaction was allowed to sit at room 
temperature for 15 minutes. HPLC purification in 0.1% 
(w/v) TFA with gradient elution of 0.25% CH3CN min" 1 
5 Bromoacetylated polyamide was recovered, (0.184 mg, 135.6 u 
mol) . UV A. max (s): 312 nm (66,600). MALDI-TOF MS: 1358.3 
(1357.29 calculated for M+l) . 



E. AcPyPyPyPy-y-lmPyPyPy-Dp 

Synthesized as above. UV X max (e) : 318 nm (66,600). 
MALDI-TOF MS: 1279.5 (1279.4 calculated for M+l). 



F. NH2 Py Py - P - Py Py Py - y - ImPy Py - p - Dp 

Synthesized as above. UV Xmax(e): 310 nm (66,600). 

15 X H NMR (DMSO-d 6 ) 5 10.25(s, 1H) , 9.98 (m,2H), 9.9 (m, 3H) , 
9.72 (m, 2H), 9.3 (1H, br s) , 8.04-8.02 (m, 4H) , 7.44 (s, 
1H), 7.23 (d, 1H), 7.20 (d, 1H) , 7.18 (s, 1H) , 7.17 (d, 
1H), 7.167 (s, 1H), 7.145 (s, 1H) , 7.119 (s, 1H) , 7.08 (s, 
1H), 7.025 (s, 1H), 6.9 (s, 1H) , 6.85 (s, 1H) , 6.80 (d, 

20 1H), 6.79 (s, 1H), 3.90 (s, 3H) , 3.85 (s, 3H) , 3.81 (s, 
3H), 3.80 (s, 3H), 3.77 (s, 3H) , 3.66 (br, 12H) , 3.43-3.34 
(m, 8H), 3.17 (m, 2H) , 3.08 (m, 2H) , 2.98 (m, 2H) , 2.32 (m, 
6H) , 1.74 (m, 4H) . 



G. Ac Py Py - P - Py Py Py - y - ImPy Py - P - Dp 

To a solution of NH2PyPy-P-PyPyPy-y-lmPyPy-p-Dp in 
DMSO/NMP (500 jil) and DIEA (500 ul) was added acetic 
anhydride (400 ul) . The reaction was heated at 55°C for 15 
minutes and HPLC purified as above. UV X max (E) : 310 nm 
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(66,600). MALDI-TOF MS: 1351.0 (1350.5 calculated for 
M+l) . 

H . B r Ac Py Py - p -PyPyPy-y- ImPy Py - p - Dp 

5 Synthesized as bromoacetylated polyamide above. UV X 

max(e) : 314 nm (66,600). MALDI-TOF MS: 1429.3 (1429.4 
calculated for M+l) . 

I. NH 2 Py Py Py - P - Py Py Py - y - ImPy Py - P - Dp 

10 Synthesized as above. UV ^ m ax(e> • 314 nm (74,925). 1 H 

NMR (DMSO~d 6 ) 5 10.25 (s, 1H) , 10.02 (s, 1H) , 9.99 (s, 1H) , 
9.90 (m, 3H) , 9.72 (m, 2H) , 8 . 03 - 8 . 06 (m, 4H) , 7.44 ( S/ 1H) , 
7.23-7.21 (m # 3H) 7.17-7.12 (m, 4H) , 7.09 (d, 1H) , 6.98-6.83 
(m, 6H) , 3.92 (s, 3H) , 3.87 (s, 3H) , 3.82 (s, 3H) , 3.79 (s, 

15 3H) , 3.77 (s, 3H) , 3.51 (m, 12 H) , 2.72 (m, 4H) , 2.48 (m, 
4H) , 2.32 (m, 6H) 1.78 (m, 4H) . 

J. Ac Py Py Py - P - Py Py Py - y - ImPy Py - P - Dp 

Acetylated as above. UV X max (e) : 314 nm* (74,925). 
20 MALDI-TOF MS: 1472.0 (1472.6 calculated for M+l) 

NH2PyPy-fi-PyPyPy-y-PyPyIrn-$-Dp. Synthesized as above. UV X 

max(e): 310 nm (66,600). 1 H NMR (DMSO-d6) 6 10.31 (s, 1H) , 
9.98 (s, 1H) , 9.91 (s, 1H), 9.89 (s, 1H) , 9.84 (s, 1H) , 
9.71 (br, 2H) , 8.06-8.08 (m, 3H) , 7.95 (s, 1H) , 7.48 (s, 
25 1H) , 7.28-7.15 (m, H) , 7.08 (s, 1H) , 7.02 (m, 2H) , 6.91- 
6.86 (m, 3H) , 6.80 (s, 1H) , 3.91 (s, 3H) , 3.86 (s, 3H) , 
3.82-3.78 (m, 12H) , 3 . 56-3 . 43 (m, 12H) , 3.22 (m, 2H) , 3.10 
(m, 2H) , 3.0 (m, 2H) , 2.4 (m, 2H) , 2.3 (m, 6H) , 1.7 (m, 4H) 
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K. Ac Py Py - P - Py Py Py - y - Py Py Im- p - Dp 

Acetylated as above. UV A. ma x(e): 310 nm (66,600). 
MALDI-TOF MS: 1350.0 (1350.5 calculated for M+l) 
NH2PyPyPyPyPyPy-y-PyPyIm-$-Dp. Synthesized as above. UV X 

5 max(e): 310 nm (74,925). 1 H NMR (DMSO-dg) S 10.31 (s, 1H) , 

10.04 (s, 1H) , 9.95 (d, 3H) , 9.90 (s, 1H) , 9.84 (d, 2H) , 

8.08-8.06 (m, 3H) , 7.48 (s, 1H) , 7.28 (s, 1H) , 7.23-7.21 

(m, 2H) , 7.15 (s, 1H) , 7.10-7.07 (m, 8H) , 6.94-6.87 (m, 
4H), 3.95-3.78 (m, ), 3.58 (m, 2H) , 3.43 (m, 2H) , 3.19 (m, 

10 2H) , 3.07 (m, 2H) , 2.36 (m, 2H) , 2.32 (m, 6H) , 2.25 (m, 
2H) , 1.74 (m, 4H) . 

L . AcPyPyPyPyPyPy-y-PyPylm-p -Dp 

Acetylated as above. UV A, max (e) : 310 nm (74,925). 
15 MALDI-TOF MS: 1401.0 (1401.5 calculated for M+l). 

BrAcPyPyPyPyPyPy-y-PyPylm-$-Dp . Bromoacetylated as above. 
UV ^max(e) : 310 nm (74,925). MALDI-TOF MS: 1480.7 (1480.4 
calculated for M+l) . 

20 M. Alkylation reactions 

Alkylation was examined on a 262 bp restriction 
fragment (EcoRl/FspI) of pBR322, radiolabeled on the 3' end 
(10,000 cpm/reaction) . Polyamide or bromodistamycin were 
added at appropriate concentrations. Final reaction 

25 concentrations were 10 mM sodium phosphate (pH 7.0), 100 jiM 
sonicated calf thymus DNA. The reactions were incubated at 

3 7°C for 0, 1, 5, 10, 20, and 40 hours. Following 
incubation, the reactions were ethanol precipitated and 
dissolved in 10 (il 10 mM sodium phosphate buffer and heated 



142 





WO 98/49142 



PCT/US98/06997 



at' 90 C for 15 minutes. Piperidine (40 1.4 M) was added 

and the reaction heated again for 15 minutes at 90°C. 
Piperidine was lyophilized off and the reactions were 
resuspended in 7 (al lx TBE/80% formamide loading buffer, 

5 denatured by heating at 85°C for 10 minutes and placed on 
ice. Reactions were electrophoresed on 8% polyacrylamide 
gels (5% cross link, 7 M urea) in lx TBE at 2000V. Gels 
were dried and exposed to a storage phosphor screen 
(Molecular Dynamics) . 



N. NH2PyPyPyPy-y-ImPyPyPy-NH(CH2) 20H 

Polyamide was synthesized as above on glycine linked 
Pam resin. For cleavage, resin (500 rag) was weighed out 
into a 50 ml flask in 5 ml 100% EtOH. An equal weight of 
15 LiBH4 (500 mg, 23 mmol) was slowly added. Resin was 

incubated at 55°C for 2 hours, adding more ethanol as 
needed. Polyamide was HPLC purified as above. UV X max (z) : 
314 nm (66,600). MALDI-TOF MS: 1124.0 (1124.2 

calculated). X H NMR (DMSO-d 6 ) 5 10.26 (s, 1H) , 10.02 (s, 
20 1H), 9.99 (s, 1H), 9.94 (d, 2H) , 9.89 (d, 2H) , 8.02 (m, 
1H), 7.91 (m, 1H), 7.43 (s, 1H) , 7.24-7.20 (m, 2H) , 7.16- 
7.12 (m, 2H) , 7.09-7.02 (m, 4H) , 6.92-6.84 (m, 4H) , 3.92- 
3.87 (m, 6H) , 3.83-3.77 (m, 18H) . 

25 EXAMPLE 4 

Polyamide CBI Unit 

(+)CC-1065 is a natural product isolated from 
Streptomyces zelensis. It binds in the minor groove and 
shows antitumor activity due to a reactive cyclopropyl 



10 



143 



WO 98/49142 



PCT/US98/06997 



moiety which alkylates preferentially at N3 of adenine 
(Boger and Johnson. Angew. Chem. Int. Ed. Eng. 1996, 3 5 , 
1438-1474) . 

Also in this class of antitumor antibiotics are the 
5 duocarmycins . They are structurally very similar to CC- 
1065, having the reactive cyclopropyl ring, but lacking the 
third conjugated ring system. These compounds bind in AT 
tracts, and display strong sequence selectivity for 
alkylation at adenines. Alkylation will occur at N3 of 

10 guanine as well, but only when other AT bp are protected in 
the minor groove. The flanking sequence preferences for 
alkylation by CC-1065 are 5 * -AAA- 3 1 >5 r -TTA-3 ' >5 ' -TAA- 3 1 >5 » - 
ATA- 3 1 . The alkylation reaction is reversible for the two 
duocarmycin compounds but irreversible for CC-1065. This 

15 discrepancy is explained by the more extensive non-covalent 
interactions of CC-1065 with the DNA minor groove. (+) CC- 
1065 is the natural enantiomer. The unnatural enantiomer 
has been synthesized by Boger and coworkers and shown to 
alkylate DNA as well. Interestingly, where the natural 

20 enantiomer binds 3 ! to 5 1 from the site of alkylation, the 
unnatural enantiomer binds 5' to 3'. Structures of ( + ) CC- 
1065 and the duocarmycins are shown in Figure 20. 

When compared to N-Bromoacetyldistamycin, CC-1065 
shows very different reactivity. For reaction times of 1 

25 hour at 37° C, N-Bromoacetyldistamycin shows almost no 
visible cleavage, while (+) CC-1065 shows intense cleavage 

at 13 adenines. After 10 hours at 37°, N- 

Bromoacetyldistamycin shows a comparable amount of cleavage 
to (+) CC-1065 at 1 hour, but at only one adenine. Despite 
30 the apparent similarities between these two molecules, 
being crescent -shaped with an electrophile that covalently 
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binds DNA, the cyclopropyl electrophile of CC-1065 
alkylation shows faster kinetics than that of N- 
Bromoacetyldistamycin. The alkylation mechanism for CC-1065 
is shown in Figure 21. 
5 Several pro-drug analogues of CC-1065 have also been 

made. One of the most popular is bizelesin, a bifunctional 
interstrand DNA crosslinker synthesized by Upjohn. It is 
believed to go through the same cyclopropyl intermediate as 
CC-1065, but is more stable than the cyclopropyl analogues. 

10 The structures of Bizelesin and CBI are shown in Figure 22. 

Boger et al have synthesized many modified versions of 
the A ring of ( + ) CC-1065 to examine the effects of steric 
changes on the alkylation potency of these drugs. In his 
work with CC-1065 derivatives, it has been shown that there 

15 is a direct linear correlation between drug stability and 
cytotoxicity. The more solvolytically stable compounds 
also show the highest degree of cytotoxicity. The most 
successful modification thus far, is the synthesis of 
1,2,9, 9a-tetrahydrocyclopropa [c] benz [e] indol-4-one (CBI) 

20 which replaces the fused pyrrole with a fused benzene ring, 
releasing ring strain in the system. (Boger, D. L . , Yun, 
W., and Han, N. Bioorganic and Medicinal Chemistry 1995, 3, 
1429-1453.) When coupled to the B and C rings of CC-1065, 
CBI showed greater stability, reactivity, and selectivity 

25 than ( + ) CC-1065 itself. Boger has also shown that a Boc 
protected CBI unit is sufficient for DNA alkylation. Its 
fast kinetics and efficient alkylation make CBI an ideal 
candidate to tether to a hairpin to generate a powerful 
sequence specific alkylator. 
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The CBI subunit was synthesized as described by Boger. 
(Boger, D. L. a. McKie., J. A. J. Org. Chem. 1995, 60, 1271- 



synthesized by the condensation of ammonia and 1,3 
dihydroxynaphthalene with immediate Boc protection by Boc 
anhydride. After protection of the alcohol with benzyl 
bromide, treatment with NIS provided the iodonaphthylamine . 
Alkylation with allyl bromide provided a substrate for a 
favorable 5-exo-trig aryl radical -alkene cyclization to 
occur, using Bu3SnH and TEMPO radical trap. Cleavage of 
the TEMPO trap intermediate occurred upon heating with 
activated Zn powder. Treatment with PPh3/CCl4, followed by 
hydrolysis of the benzyl ether gave the desired product. 

In order to work out the coupling conditions of the 
polyamide-CBI unit, a simple three ring compound, ImPyPy-p- 
NH(CH2>2NH2 was made. A new activation strategy used 
disuccinimidyl glutarate (DSG) , a diacid activated with NHS 

5 esters. A P-alanine linker was added to the CBI unit to 
facilitate completion of the reaction, according to the 
procedure by Lukhtanov, E.A. and coworkers. (Lukhtanov, et 
al. Nucleic Acids Research 1996, 24, 683-687.) After HPLC 
purification, one major peak was isolated. This fraction 

10 was analyzed by mass spectrometry and NMR and could be 
identified as the polyamide-CBI (chloro) conjugate. The 
synthesis of a CBI-polyamide conjugate is shown in Figure 
23. 

15 A. ImPyPy-fi-ED 

Polyamide was synthesized as above, and cleaved with 
ethylene diamine. HPLC purification as above. UV X ma x(e) : 



1275. ) 



Briefly, 



N - Boc - 4 - hydroxy - 2 - nap t hy 1 amine 



was 
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300 nm (24,975). 1 H NMR (DMSO-d 6 ) 5 10.47 (s, 1H) , 9.91 (s, 

1H) , 8.06 (m, 1H) , 7.67 (m, 1H) , 7.38 (s, 1H) , 7.25 (s, 

1H) , 7.15 (s, 1H) , 7.12 (s, 1H) , 7.02 (s, 1H) , 6.84 (s, 

1H) , 3.96 (s, 3H), 3.80 (s, 3H) , 3.76 (s, 3H) , 3.37 (m, 
5 2H) , 3.35 (m, 2H) , 2.83 (m, 2H) , 2.46 (m, 2H) . 

B . ImPyPy - (J - ED - succ in imi de - NHS 

ImPyPy-(i-ED (10 mg) was dissolved in 2 ml DMF added 100 
jil at a time to a solution of disuccinimidyl glutarate (100 
10 mg) and DIEA (10 |il) in 1 ml DMF at room temperature. The 
reaction was monitored by analytical HPLC and was complete 
within an hour after final addition of polyamide. 
Preparative HPLC gave a white powder. MS (FAB): 695.2 

(calculated 694.3). 1 H NMR (DMSO-d6) 5 10.49 (s, 1H) , 9.91 

15 (s, 1H) , 8.01 (m, 1H) , 7.92 (m, 1H) , 7.87 (m, 1H) , 7.38 (s, 

1H) , 7.25 (s, 1H), 7.17 (s, 1H) , 7.11 (s, 1H) , 7.04 (s, 

1H) , 6.78 (s, 1H), 3.96 (s, 3H) , 3.80 (s, 3H) , 3.76 (s, 

3H) , 3.33 (m, 2H) , 3.05 (m, 2H) , 2.77 (m, 2H) , 2.65 (m, 

2H) , 2.55 (m, 2H), 2.30 (m, 2H) , 2.28 (m, 2H) , 2.16 (m, 
20 2H) , 2.13 (m, 2H) , 1.78 (m, 2H) . 

C . Boc - (i - alanine - CBI 

Deprotect alcohol (217 mg, 0.65 mmol) as above. After 
removing ethyl acetate, dissolve in dry DMF (10 ml) . Add 
25 to Boc-p-alanine (245.98 mg, 1.3 mmol) and EDC (767 mg, 4 
mmol). Reaction was stirred under argon overnight. 

Solvent was removed in vacuo and precipitated in 20 ml of 
water. The precipitate was centrifuged, washed, with 
water, and lyophilized. Flash chromatography gave a yellow 

30 powder. 1 H NMR (DMSO-d6> 5 9.43 (br s, 1H) , 8.31 (s, 1H) , 



147 



WO 98/49142 



PCT/US98/06997 



8,29 (S, 1H), 7.65 (d, 1H) , 7.53 (t, 1H) , 7.40 (t, 1H) , 
5.56 (m, 1H) . 

D. CBI conjugated polyamide 

5 Boc-p-alanine CBI was deprotected as above. A solution 

of ImPyPy-p-ED-succinimide-NHS (20 mg) in 100 |il DMF was 
added with 10 ^1 DIEA. Reaction was stirred at room 
temperature under argon overnight. HPLC purification gave 

a white powder. MS (FAB) : 884.1 (calculated 884.4) 1 H NMR 
10 (DMSO-d 6 ) 8 10.50 (s, 1H) , 10.36 (br s, 1H) , 9.91 (s, 1H) , 
8.04 (m, 2H) , 7.92 (m, 2H) , 7.87 (m, 1H) , 7.81 (m, 1H) , 
7.73 (m, 2H) , 7.44 (t, 2H) , 7.41 ( S/ 1H) , 7.27-7.25 (m, 
3H) , 7.16 (S, 1H) , 7.10 (s, 1H) , 7.06 (s, 1H) . 

15 EXAMPLE 5 

Polyamide -Intercala tor Conjugates 

The artificial regulation of protein :DNA interactions 
is a potentially powerful therapeutic tool. Protein 
recognition of DNA, both specific and non-specif ic, is 

20 based heavily on the nearby DNA structure. (Luisi, B. 
(1995) in DNA-Protein: Structural Interactions, ed. Lilley, 
D. M. J. (IRL Press, Oxford), p. 23.) For example, a 

bent sequence of DNA may recruit a non-specific protein, as 
in HMG-I, or prevent a protein from making the appropriate 

25 contacts for high-affinity binding. Small molecules 
designed to bind predetermined sequences of DNA and 
modulate the local DNA topology may be a general approach 
for regulation of the function of DNA binding proteins. 

Intercalators are a class of molecules which are 

30 potent antibiotic and antitumor drugs. (Neidle and Abraham, 
(1984) CRC Crit. Rev. Biochem. 11, 73-121. Wang, A. H-J. 
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(1992) Curr. Opin. Struct. Biol. 2, 361-368.) Lerman first 
described intercalation as the insertion of a flat, 
aromatic chromophore between adjacent base pairs of the 
double helix. (Lerman, L. S. (1961) /. MoL Biol 3, 18-30.) The rise of 
5 B-form DNA is usually 3.4A/base pair. The stacking of the 
intercalator separates the adjacent base pairs by another 
3.4A and extends the length of the helix and equivalent 
amount per bound intercalator. The base pairs neighboring 
the intercalation site are also unwound 10-26° with respect 

10 to one another. Generally, it is these structural 

distortions introduced by intercalation which are 
considered to be the basis for their therapeutic activity. 
However, it is important to note that in most cases the DNA 
helix returns to its B-form structure within a few base 

15 pairs of the intercalation site. 

Due to their nature of stacking between the base 
pairs, intercalators generally exhibit little or no 
sequence specificity. A few natural products, such as 
actinomycin D and the anthracycline and pluramycin families 

20 of intercalators, have added f unctionalitites which impart 
preference for certain dinucleotide steps. (Hansen and 
Hurley (1996) Acc . Chem. Res. 29, 249-\.) Actinomycin D, 
consists of an aromatic phenoxazone core coupled to two 
identical cyclic pentapeptides that make contacts to the 

25 exocyclic amine of guanine, granting specificity for 
intercalation at 5'-GC-3' steps. Similarly, carbohydrate 
moieties attached to the chromophore of the anthracycline 
and pluramycin intercalators interact with the DNA bases in 
both the major and minor grooves and grant these molecules 

30 their sequence preferences. In almost all cases, the 
sequence specificity of these natural products is limited 
to the two base pairs adjacent to the intercalation site. 
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Netropsin and distamycin A are pyrrole carboxamide 
natural products which bind in the minor groove of DNA at 
sites of 4-5 contiguous A, T base pairs. (Krylov, et al . 

5 (1979) Nucleic Acids Res. 6, 289-304 ; Zasedatelev, et al . 
(1974) Mol. Biol. Rep. 1, 337-342; Zasedatelev, et al . 
(1976) Dokl. Akad. Nauk SSSR (1976) 231, 1006-1009; Zimmer, 
and Wanhert (1986) Prog. Biophys . Mol . Biol. 47, 31-112; 
Van Dyke, et al. (1982) Proc. Natl. Acad. Sci . , USA 19, 

10 5470-5474; Van Dyke and Dervan (1982) Cold Spring Harbor 
Symp. Quant. Biol. 47, 347-353; Van Dyke and Dervan, (1983) 
Biochemistry 47, 2373-2377; Harshman and Dervan (1985) 
Nucleic Acids Res. 13, 4825-4835; Fox and Waring, (1984) 
Nucleic Acids Res. 12, 9271-9285; Lane et al . (1983) Proc. 

15 Natl. Acad. Sci., USA 80, 3260-3264). In an attempt to 
create an intercalator with designed sequence specificity, 
a number of researchers have linked analogs of distamycin 
or netropsin to a non-specific intercalator. (Bailly and 
Henichart, (1991) Bioconj . Chem. 2, 379-393; Bourdouxhe- 

20 Housiaux, et al . (1996) Biochemistry 35, 4251-4264; Bailly, 
et al. (1994) Biochemistry 33, 15348-15364; Subra, et al. 
(1991) Biochemistry 30, 1642-1650; Eliadis, et al . (1988) 
J. Chem. Soc. Chem. Comm. 1049-1052; Wang, et al , (1994) 
Gene 149, 63-67; Arcamone, F. (1994) Gene 149, 57-61.) 

25 Although these efforts have met with some success, these 
compounds target mixed sequences of A*T and G*C base pairs. 
More specifically none of these compounds can bind a broad 
range of predetermined DNA sequences. Even more 
specifically, none of these compounds can bind a 

30 predetermined sequence with subnanomolar affinity. 

Linking a non-specific intercalator moiety to a 
polyamide may produce the sequence specific distortions of 
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DNA structure required to regulate protein : DMA 
interactions. Ethidium bromide is a common intercalator 
which has been shown to bind DNA with a Ka of approximately 
5 -1 

10 M and unwind the DNA helix by 26°. (LePecq and 
5 Paoletti (1967) J. Mol. Biol. 27, 87-106; Waring, M. (1970) 
J". Mol. Biol. 54, 247-279; Wang, J. C. (1974) J. Mol . Biol. 
89, 783-801; Bresloff and Crothers (1975) J. Mol. Biol. 95, 
103-123.) A derivative of ethidium, methidium, has been 
used previously in the preparation of designed 

10 intercalators and serves as the basis of methidium-propyl - 
Fe»EDTA (MPE) f ootprinting . (Dervan and Becker (1978) J. Am. 
Chem. Soc. 100, 1968-1970; Hertzberg and Dervan (1982) J. 
Am. Chem. Soc. 104, 313-315.) The synthesis and DNA- 
binding properties of a series of methidium-polyamide 

15 conjugates have been discovered by the present inventor. 

Methidium-polyamide conjugates are designed to 
sequence specifically induce helical unwinding and 
extension which may be sufficient to inhibit DNA binding by 
a wide variety of DNA binding proteins, such as the 

20 transcription factor, GCN-4, SP1, and NF-kB. 

A. Design and Synthesis of Methidium- Polyamide 
Intercalators 

A series of methidium-polyamide conjugates of the 
25 general design DMy-ImPyPy-y-ImPyPy-p-C n -Mdm (DMy=N, N- 
dimethyl-y-aminobutyric acid, C n =diamine linker of n 
carbons, Mdm=p-carboxymethidium) were synthesized. 
Polyamides generally contain a C-terminal positively 
charged dimethylaminopropyl amide. In this case, since the 
30 C-terminus is conjugated to the methidium, DMy was placed on 
the N-terminus to retain the net positive charge. This 
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alteration has no significant effect on polyamide binding. 
Boc-chemistry solid phase polyamide synthesis allows for 
the rapid preparation of milligram quantities' of purified 
polyamide suitable for methidium conjugation in solution. 

5 DMy-ImPyPy-y-ImPyPy-p-Pam-resin was prepared from Boc-Py-OBt 
ester and Boc-Im acid monomers. Aminolysis with various 
diamines (NH2(CH2) nNH2, n=2 , 4, 6) followed by preparatory 
HPLC purification afforded free polyamide with a C-terminal 
primary amine suitable for coupling to methidium. Reaction 

10 of the polyamide amine with the acylimidazole ester ofp- 
carboxy methidium and HPLC purification produced a series 
of methidium-polyamide conjugates with various linker 
lengths. The polyamide/methidium coupling reaction was 
quantitative by analytical HPLC. An average recovery of 

15 purified conjugate of 14.5% from DMy-ImPyPy-y-ImPyPy-p-Cn- 

NH2 was achieved. The 1 H NMR spectrum of each conjugate 
has resonances consistent with polyamide and methidium 
protons, as well as an additional broad triplet at 8.75 ppm 
resulting from the amide bond formed in the 

20 polyamide/methidium coupling reaction. MALDI-TOF mass 
spectrometry analysis of each conjugate reveals the 
presence of compounds consistent with the mass of the 
conjugated species, with no free polyamide or methidium 
observed. The synthesis of bifunctional methidium-polyamide 

25 conjugates is described in Figure 24. 

DMy-ImPyPy-Y-ImPyPy-P-Cn-Mdm conjugates are targeted 
to the 5'-TGACT-3' portion of the ARE and GCRE binding 
sites of GCN4 . By CPK modeling, intercalation is expected 
to occur between the two base pairs at the 3' end of the 

30 GCN4 biding site, AT for ARE ( 5 ' - CTGACTAAT - 3 ' ) and TT GCRE 
( 5 ' -ATGACTCTT-3 ' ) (intercalation site bolded) . Coupling of 
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the methidium (K a 10 M" ) and polyamide (K a 10 5 m" 1 ) 
moieties is also expected to produce a significant increase 
in binding affinity. Binding of a methidium-polyamide 
conjugate to a 5 ' -AGTGTA-3 ' site is depicted below. The 
5 methidium is represented as a grey rectangle and is placed 
between the base pairs where intercalation is predicted 
based on molecular modeling studies. 

5 ' -G C A TAAGTGTAAAGC C-3 ' 




3*-C G T ATTCACATTTCG G-5 ' 

10 

1 H NMR were recorded on a GE 300 instrument operating 
at 300MHz. Spectra were recorded in DMSO-d^ with chemical 
shifts reported in parts per million relative to residual 
DMSO- ds. UV spectra were measured on a Hewlett-Packard 

15 Model 8452A diode array spectrophotometer. Matrix- 
assisted, laser desorption/ionization time of flight mass 
spectrometry was carried out at the Protein and Peptide 
Microanalytical Facility at the California Institute of 
Technology. HPLC analysis was performed either on a HP 

20 1090 M analytical HPLC or a Beckman Gold system using a 
Rainen C18, Microsorb MV, 5 /im, 300 x 4.6 mm reversed 
phase column in 0.1% (wt/v) TFA with acetonitrile as eluent 
and a flow rate of 1.0 ml/min, gradient elution 1.25% 
acetonitrile/min. Preparatory HPLC was carried out on a 

25 Beckman instrument using a Waters DeltaPak 2 5 x 100 mm 100 
Cis column in 0.1% (wt/v) TFA, gradient elution 
0.25%/min. CH3CN. Water was obtained from a Millipore 
Milli-Q water purification system. Reagent-grade chemicals 
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were used unless otherwise stated. Restriction 
endonucleases were purchased from either New England 
Biolabs or Boeringher-Mannheim and used according to the 
manufacturer ■ s protocol . Sequenase (version 2.0) was 
5 obtained from United States Biochemical, and DNase I 

(FPLCpure) was obtained from Boeringher-Mannheim. [a- 32 P] - 
Thymidine-5 ' -triphosphate (3000 Ci/mmol) , and [a- 32 P] - 
deoxyadenosine-5 • -triphosphate (6000 Ci/mmol), were 
purchased from Du Pont/NEN. 

10 

B. Synthesis of Polyamide-Methidium Conjugates 

Boc-Im acid and Boc-Py-OBt were synthesized in 5 and 6 
steps, respectively. DMy-ImPyPy-y-ImPyPy-p-Pam- resin was 
prepared using Boc-chemistry manual solid phase synthesis 

15 protocols. Polyamide was cleaved from the resin (400 mg) 
by aminolysis in neat diamine (2 mL, 24-48 hours, 60° C) 
and purified by preparative HPLC. p-Carboxy methidium acid 
(50 mg) in DMSO (1 mL) was activated by reaction with 
- carbonyl diimidazole (22 mg) and N-ethylmorpholine (15 |iL) 

20 in DMSO (200 nL) (25° C, 1 hour) . Aliquots of this solution 
(375 nL) and DIEA (150 ^L) were added to DMy-ImPyPy-y- 
ImPyPy-p-C n -NH2 (n=2, 4, 6) polyamides, each in DMSO (150 \x 
L) . After 12-24 hours the reaction was diluted with 0.1% 
(wt/v) TFA (5 mL) and purified by HPLC. 

25 

C. DMy - ImPy Py - y - ImPy Py - P - C 2 - Mdm 

Coupling of p-carboxy methidium acid acid to DMy- 
ImPyPy-y-ImPyPy-p-C2-NH 2 (20 mg, 19 ^imol) afforded DMy- 
ImPyPy-y-lmPyPy-p-C 2 "Mdm as a purple powder. (3.0 mg, 2 \x 
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mol, 10.5% recovery). HPLC r.t. 28.9, UV A. max (e) , 290 

(93,000); 1 H NMR (DMSO-d 6 ); d 10.38 (s, 1 H) , 10.25 (s, 1 
H), 9.99 (s, 1 H), 9.95 (s, 1 H) , 9.89 (s, 2 H) , 9.4 (br s, 
1 H) , 8.80 (t, 1 H) , 8.65 (t, 2 H) , 8.18 (d, 2 H) , 8.08 
5 (m, 3 H) , 7.73 (d, 2 H) , 7.48 (d, 1 H) , 7.42 (s, 1 H) , 
7.41 (s, 1 H), 7.30 (d, 2 H) , 7.23 (s, 2 H) , 7.15 (m, 2 H) , 
6.86 (s, 1 H), 6.82 (s, 1 H) , 6.30 (d, 1 H) , 3.93 (s, 9 H) , 
3.82 (s, 3 H), 3.80 (s, 3 H) , 3.78 (s, 6 H) , 3.38 (m, 2 H) , 
3.18 (m, 2 H), 3.08 (m, 4 H) , 2.76 (d, 6 H) , 2.35 (m, 9 H) , 
10 1.92 (m, 2 H) , 1.76 (m, 2 H) . MALDI-TOF MS 1390.8 (1390.6 
calc. for M+H) . 

D. DMy-ImPyPy-y-ImPyPy-p-C4-Mdm 

Coupling of p-carboxy methidium acid acid to DMy- 
15 ImPyPy-y-lmPyPy-p-C 4 -NH2 (35 mg, 32 umol) afforded DMy- 
ImPyPy-y-lmPyPy-P-C4-Mdm as a purple powder. (8.7 mg, 6 u 
mol, 19% recovery). HPLC r.t. 29.7, UV X. max (e) , 290 

(93,000); 1 H NMR (DMSO-d 6 ) ; d 10.38 (s, 1 H) , 10.25 (s, 1 
H), 9.97 (s, 1 H) , 9.95 (s, 1 H) , 9.90 (s, 2 H) , 9.4 (br s, 

20 1 H), 8.74 (t, 1 H), 8.61 (t, 2 H) , 8.17 (d, 2 H) , 8.02 
(m, 2 H), 7.95 (t, 1 H) , 7.73 (d, 2 H) , 7.48 (d, 1 H) , 
7.43 (s, 1 H), 7.42 (s, 1 H) , 7.30 (d, 2 H) , 7.23 (s, 2 H) , 
7.15 (m, 2 H), 6.87 (s, 1 H) , 6.82 (s, 1 H) , 6.30 (d, 1 H) , 
3.93 (m, 6 H), 3.92 (s, 3 H) , 3.82 (s, 3 H) , 3.81 (s, 3 H) , 

25 3.77 (s, 6 H) , 3.38 (m, 2 H) , 3.18 (m, 2 H) , 3.08 (m, 4 H) , 
2.76 (d, 6 H), 2.40 (m, 9 H) , 1.92 (m, 2 H) , 1.78 (m, 2 H) , 
1.50 (m, 4 H) . MALDI-TOF MS 1418.7 (1418.6 calc. for M+H). 
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E k DMy - ImPy Py - y - ImPy Py - P - C 6 - Mdm 

Coupling of p-carboxy methidium acid acid to DMy- 
ImPyPy-y-ImPyPy-P-C 6 -NH2 (30 mg, 27 jimol) afforded DMy- 
ImPyPy-y-ImPyPy-p-Cg-Mdm as a purple powder. (5.3 mg, 3.7 ju 
5 mol, 14% recovery). HPLC r.t. 30.6, UV A, m ax (e) , 290 

(93, 000); X H NMR (DMSO-d 6 ) ; d 10.38 (s, 1 H) , 10.26 (s, 1 
H) , 9.98 (s, 1 H) , 9.95 (s, 1 H) , 9.90 (s, 2 H) , 9.4 (br s, 
1 H), 8.73 (t, 1 H), 8.61 (t, 2 H) , 8.17 (d, 2 H) , 8.02 
(m, 2 H) , 7.89 (t, 1 H) , 7.73 (d, 2 H) , 7.48 (d, 1 H) , 

10 7.44 (s, 1 H) , 7.43 (s, 1 H) , 7.30 (d, 2 H) , 7.24 ( S/ 2 H) , 
7.14 (m, 2 H) , 6.88 (s, 1 H) , 6.83 (s, 1 H) , 6.31 (d, 1 H) , 
3.93 (m, 9 H), 3.83 (m, 6 H) , 3.78 ( S/ 6 H) , 3.31 (m, 2 H) , 
3.18 (m, 2 H), 3.03 (m, 4 H) , 2.76 (d, 6 H) , 2.35 (m, 9 H) , 
1.92 (m, 2 H) , 1.78 (m, 2 H) , 1.50 (m, 6 H) . MALDI-TOF MS 

15 1446.7 (1446.9 calc. for M+H) . 



EXAMPLE 6 
Polyamide Dye Conjugates 

Solution methods for the sequence-specific detection 
20 of nucleic acids offer several advantages in terms of 
sample preparation and of time resolution of measurements. 
Currently most efforts in this direction focus on 
hybridization methods of single stranded targets. The 
targeting of double helical DNA allow for the direct 
25 detection of biological DNA samples including plasmid, 
cosmid, or genomic DNA. DNA-binding pyrrole- imidazole 
polyamide will sequence -spec if ically deliver 

environmentally sensitive f luorochromes to the DNA. Several 
dyes show a markedly increased fluorescence upon binding to 
30 DNA, among these are Hoechst 33258, ethidium bromide, and 



156 





WO 98/49142 



PCT/US98/06997 



most notably thiazole orange. More generally, dyes such as 
dansyl and mansyl show tremendous sensitivity to 
environment . 

Conjugates have been prepared with a number of such 
5 dyes in order to develop sequence-specific, high affinity 
DNA f luorochromes . The polyamide portion of each dye was 
prepared using solid phase synthetic methodology and 
reacted with an amine reactive f luorochrome . A number of 
dyes and 1 linker diamines' are being investigated. These 

10 conjugates are unique, in that they combine the ability to 
recognize any predetermined DNA sequence with the ability 
to signal binding events directly. 

The synthesis of a polyamide -rhodamine conjugate is 
outlined below. [(i) 80% TFA/DCM, 0.4M PhSH; (ii) BocPy- 

15 OBt, DIEA, DMF; (iii) 80% TFA/DCM, 0.4M PhSH; (iv) BocPy- 
OBt, DIEA, DMF; (v) 80% TFA/DCM, 0 . 4M PhSH; (vi) BocPy-OBt, 
DIEA, DMF; (vii) 80% TFA/DCM, 0 . 4M PhSH; (viii) Boc-y- 
aminobutyric acid-Im-OBt, HBTU, DIEA, DMF; (ix) 80% 
TFA/DCM, 0.4M PhSH; (x) BocPy-OBt, DIEA, DMF; (xi) 80% 

20 TFA/DCM, 0.4M PhSH; (xii) BocPy-OBt, DIEA, DMF; (xiii) 80% 
TFA/DCM, 0.4M PhSH; (xiv) BocPy-OBt, DIEA, DMF; (xv) 80% 
TFA/DCM, 0.4M PhSH; (xvi) Imidazole-2 -carboxylic acid 
(HBTU/DIEA) ; (xvii) diamino-JV-methyldipropylamine , 55 °C; 
(xviii) 5-carboxyrhodamine 6G succinimidyl ester, 20mM 

25 HEPES, pH 7.5, 25 °C] (Figure 25). The chemical structures 
of a number of polyamide -DYE conjugates are shown in 
Figures 26A-D. 



Systems which show enhanced or specific fluorescence 
upon binding to a specific DNA sequence could be useful 



EXAMPLE #7 



30 



DNA Detection Through Energy Transfer 
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reagents for genomic analysis. Energy transfer between Dyes 
provides a means of detecting simultaneous binding of 
sequence-specific imidazole -pyrrole polyamides " to proximal 
DNA binding sites . (Ju, et al. Proc. Natl Acad. Sri. 92, 4347-4351; Nie et al., 
5 Science 266, 1018-1021.) Two DNA binding polyamides will be 
prepared to target adjacent DNA binding site, one 
conjugated to a donor dye, the other conjugated to an 
acceptor dye. Dye pairs will be chosen such that the donor 
dye can be excited without exciting the acceptor. With 

10 excitation at this energy, fluorescence of the acceptor 
fluorochrome will only occur while proximal to the donor 
fluorochrome through energy transfer from the donor. The 
required binding of the two polyamides will lengthen the 
effective recognition sequence to the levels appropriate 

15 for genomic level analysis and will improve the specificity 
of the technique. 

Using dye conjugation chemistry developed by the 
present inventor, conjugates will be prepared purified, and 
characterized. Donor-acceptor pairs such as fluorescein- 

20 rhodamine or thiazole-orange/rhodamine will be analyzed for 
their computability in this system. This energy transfer 
system increases the currently accessible recognition 
sequence for polyamides and provides for a unique binding - 
dependent signal, applicable for both homogeneous and 

25 heterogeneous detection systems. 

Pyrene and similar systems for excimers (excited state 
dimers) provide two or more molecules are close in three 
dimensional space. DNA-binding polyamides deliver pyrene to 
proximal positions on DNA. Binding is then monitored by the 

30 formation of the excimer. 
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The structure of a pyrene polyamide conjugate is shown 
below. 




Pyrene-PyPyPy-p-lmPyPy-Y-lmPyPy-p-Dp 

5 EXAMPLE 8 

Polyamide Biotin Conjugates 

Conjugates prepared between sequence specific DNA 
binding polyamides and biotin are useful for a variety of 
applications. First, such compounds can be readily attached 

10 to a variety of matrices through the strong interaction of 
biotin with the protein streptavidin. (Weber, P.C., Ohlendorf, 
D.H., Wendoloski, J J., Salemme, F.R. Science 243, 85-88) Readily available 
strepdavidin-derivatized matrices include magnetic beads 
for separations as well as resins for chromatography. (Ito,T., 

15 Smith, C.L., Cantor, C.R. Proc. Natl Acad. Sri. 89, 495-498; Tagle, D.A., Swaroop, 
M., Lovett, M., Collins, F.S. Nature 361, 751-753) 

A number of such polyamide -biotin conjugates have been 
synthesized by solid phase synthetic methods. Following 
resin cleavage with a variety of diamines, the polyamides 

20 were reacted with carious biotin carboxylic acid 
derivatives to yield conjugates. The conjugates were 
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purified by HPLC and characterized by MALDI-TOF mass 
spectroscopy and 1H NMR. The synthesis of biotin-polyamide 
conjugate is shown in Figure 27. The chemical structure of 
a number of bifunctional biotin-polyamide conjugates 
5 prepared by the present invention are shown in Figures 2 8A- 
D. A scheme for sequence specific affinity capture by a 
bifunctional polyamide-biotin conjugates is outlined in 
Figure 29. 



10 EXAMPLE 9 

Photoactivated Modification of DNA By A Polyamide- Psoralen 

Conjugate 

Photoactivated modification of DNA by a polyamide- 
psoralen conjugate. Psoralen and psoralen derivatives have 

15 been used as photoactive drugs in the treatment of cancer. 
(Edelson, et al . N. Engl. J. Med. 316. 297 (1987)) These 
molecules intercalate into double-helical DNA and upon 
irradiation with UVA undergo a [2+2] cycloaddition reaction 
with the 5,6 double bond of thymine residues to form both 

20 monoadducts and interstrand DNA cross-links. (Psoralen DNA 
Photobiology , Volumes 1 and 2; Gesparro, F.P., Ed. CRC 
Press, Inc., Boca Raton, Fl . 1988.) Our recent interests 
have focused on the synthesis and in vitro analysis of 
photoactive polyamide-psoralen conjugute B which is 

25 designed to form covalent attachments to DNA in a sequence- 
specific manner. The use of light as a trigger for the 
permanent covalent modification of DNA may prove to be 
attractive tool for potential in vivo applications such as 
the specific inhibition of transcription by minor groove 

30 binding polyamides. The extended hairpin polyamide-psoralen 
conjugate B was synthesized by coupling the OBt ester of 5- 
(8-psoralenyloxy)pentanoic acid. (Lee, et al., <J. Med. 
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Chem. 1994, 37, 1208.) to the extended hairpin polyamide 
directly on the P-alanine-Pam resin. Upon equilibration of 
the psoralen-polyamide conjugate at pH 7.5 with a 247 bp 
restriction fragment followed by irradiation at 360 nm, the 

5 extent of intrastrand cross-link formation was shown to be 
between 15-20% and 54-57% at 10 nM and 100 nM 
concentrations of polyamide respectively. Our current work 
involves the use of a polymerase stop assay as a tool to 
map the sites of intrastrand covalent modification as well 

10 as sites of potential monoadduct formation on double - 
helical DNA. The structure of the psoralen-polyamide 
conjugate is shown in Figure 30. 



EXAMPLE 10 

15 In vitro assay for polyamide binding 

An engineered, radiolabeled restriction fragment from 
pUC-19 was prepared in which a nine bp polyamide binding 
site overlaps by two base pairs with the cleavage site for 
the restriction endonuclease Pvu II. Cleavage by Pvu II is 

20 prevented when the overlapping polyamide binding site is 
occupied by the polyamide. As a control, a second 
radiolabeled DNA fragment was prepared which contains a Pvu 
II site, but lacks the overlapping polyamide binding site. 

The rate of polyamide association with its target 

25 binding site was assessed by combining solutions of the 
polyamide with the radiolabeled target and reference 
fragments and allowing them to for 5 minutes to 5 hours 
before initiating a treatment (1-2 minutes) with the enzyme 
Pvu II. Under the experimental conditions, the reference 

30 site is nearly completely digested, but protection at the 
target site is observed and can be correlated with 
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polyamide concentration and the time of equilibration. 
Similarly, the dissociation rate is analyzed by adding an 
excess of unlabeled competitor DNA to an equilibrated 
solution of the labeled DNA fragments and polyamide. 

5 Addition of the competitor reduces the concentration of 
free polyamide to zero. The rate at with polyamide 
dissociation occurs from the target site on the labeled 
fragment can be followed by the rate of loss of protection 
from Pvu II digestion as the re-equilibration time is 

10 increased. The association profile with respect to time 
for the 9 -ring extended hairpin polyamide ImPyPy-y-lmPyPy-p- 
PyPyPy-G-Dp binding its cognate 9 base pair match site is 
shown in Figure 6 . 



15 determined to be ka = 3.1 ±0.2 x 10*" 5 s*" 1 , this corresponds 
to a half-time of 6.2 h (10 mM Bis-Tris (pH 7.0), 50 mM 
NaCl, 5 mM MgCl2 , 1 mM mercaptoethanol at 37 °C) . Wherein 
half-time is defined as the time required for 50% of a 
population of DNA and polyamide to dissociate or associate. 

20 The association rate has been determined at Jc a = 1.3 ±0.8 x 

10 4 M-^-s" 1 ; this corresponds to a half-time of 3 . 0 h at 5.0 
nM. The determined value for th$ equilibrium association 
constant (K^q = 6.3 ±0.8 x 10 8 M _1 ) correlates well with 
the kinetically determined ratio (k a /kd = 4.2 ± 2.6 x 10 8 
25 M" 1 ) . 

These results demonstrate that polyamides bind to a 
designated target site within seconds to minutes, but that 
it may take hours for dissociation to occur at such a site. 
More specifically these results demonstrate that polyamides 
30 bind DNA with a combination of association and dissociation 



The extended hairpin dissociation rate has been 
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rates which provide effective modulation of the activity of 
DNA binding proteins. 



5 Cooperative Hairpin Dimers for Recognition of DNA by Py-Im 



Small molecules which permeate cells and bind 
predetermined DNA sequences have the potential to control 
the expression of specific genes. Trauger, et al . Nature 

10 1996, 382, 559-561; Gottesfeld, et al . Nature 1997, 387, 
202-205) . Recently, an eight-ring polyamide which binds to 
a six base pair target site was shown to inhibit gene 
transcription in cell culture (Gottesfeld, et al . Nature 
1997, 357, 202-205) . Polyamides recognizing longer DNA 

15 sequences should provide more specific biological activity 
(P. B. Dervan, Science 1986, 232, 464) which could be 
achieved by synthesizing larger hairpins (Turner, et al . 
EJ. Am. Chem. Soc . 1997, 119, 7636-7644). However, the 
upper limit of polyamide size with regard to efficient cell 

20 permeation is not known. 

Alternatively, a more biomimetic approach is to bind 
larger DNA sequences while maintaining the size of the 
polyamide. Nature's transcription factors often bind large 
DNA sequences by formation of cooperative protein dimers at 

25 adjacent half -sites (Ptashne, et al . A Genetic Switch, 
Blackwell Scientific Publications and Cell Press: Palo 
Alto, CA, 1986; Pabo, et al . Ann. Rev. Biochem. 1992, 61, 
1053-1095; Ma rmor stein, et al. Nature 1992, 356, 408-414; 
Klemm, et al . Cell 1994, 77, 21-32; Bellon, et al . Nature 

30 Struct. Biol. 1997, 4, 586-591). For cooperatively binding 
extended Py-Im polyamide dimers, the two ligands can slip 
sideways with respect to one another, allowing recognition 



EXAMPLE 11 



Polyamides 



163 



WO 98/49142 



PCT/US98/06997 



of other sequences (Trauger, et al . J. Am. Chem. Soc. 1996, 

118, 6160-6166; Swalley, et al . Chem. Eur. J. 1997, 3, 
1600-1607) . Hairpin polyamides utilizing the turn-specific 
y-aminobutyric acid linker are constrained to be fully 

5 overlapped and preclude the "slipped motif" option 
(Mrksich, et al . J\ Am. Chem. Soc. 1994, 116, 7983-7988; 
Parks, et al . ibid. 1996, 118, 6153-6159; Swalley, et al . 
iMd. 1996, 118, 8198-8206; Swalley, et al . ibid. 1997, 

119, 6953-6961; Trauger, et al . Chem. & Biol. 1996, 3, 369- 
10 377; Declairac, et al . , J. Am. Chem. Soc. 1997, 119, 7909- 

7916) . Provided herein is a cooperative six-ring extended 
hairpin polyamide which dimerizes to specifically bind a 
predetermined ten base pair sequence. 

A sequence contained in the regulatory region of the 

15 HIV-l genome was selected as the target site (Jones, et al . 
Ann. Rev. Biochem. 1994, 63, 717-7 '43; Freeh, et al . 
Virology 1996, 224, 256-267) . To design the ligand, the 
polyamide ring pairing rules provided herein, such as the 
inclusion of p-alanine (p) to relax ligand curvature, and 

20 the preference of y-aminobutyric acid (y) for a "hairpin 
turn" conformation within polyamide-DNA complexes were 
considered. This analysis suggested that the six-ring 
polyamide having the core sequence ImPy- (J-lmPy- y-ImPy might 
bind the target sequence 5 ' - AGCAGCTGCT- 3 ' through formation 

25 of a cooperative hairpin dimer (Figure 31) . To avoid a 
collision between the N-terminal end of one ligand and the 
C- terminal end of the second within the complex, the 
positively-charged P~ alanine- dime thylaminoproplyamide C- 
terminus used in standard polyamides has been replaced with 

30 the shorter, uncharged (CH 2 ) 2 OH group (C 2 -0H) . The cationic 
"turn" residue {R) -2 , 4-diaminobutyric acid ((£) H2N y) 
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maintains the overall +1 charge for optimal solubility in 
water. 

Polyamide ImPy- p-ImPy- (R) H2N y- ImPy-C 2 -OH was 

synthesized using solid-phase methods (E. E. Baird, P. B. 
5 Dervan, J. Am. Chem. Soc. 1996, 118, 6141-6146) on glycine- 
PAM resin (available in 0.3 mmol/g substitution from 
Peptides International, Louisville, KY) , reductively 
cleaved from the solid support using LiBH 4 (Mitchell, et 
al., J . Org. Chem. 1978, 43, 2845; Stewart, et al . Solid 

10 Phase Peptide Synthesis, Pierce Chemical Company, Rockford, 
IL, 1984) and purified by HPLC (reverse-phase) . The 
identity and purity of the polyamides was confirmed by X H 
NMR, analytical HPLC, and MALDI-TOF MS. MALDI-TOF MS 
(monoisotopic) (M+H) : ImPy- P-ImPy- (R) H2N y- ImPy-C 2 -OH, obsd 

15 953.3, calcd (C 42 H 53 N 18 0 9 ) 953.4/ ImPy-|J- ImPyPy- (R) H2N y-PyImPy- 
C 2 -0H, obsd 1197.5, calcd (Cs^s^Ou) 1197.5. 

Figure 32 illustrates the (ImPy- p- ImPy- (R) H2N y-ImPy-C 2 - 
OH) 2 • 5 ' -AGCAGCTGCT- 3 ' complex, demonstrating binding 
models for complexes of a 10 base pair match and single- 

20 base pair mismatch sites (the mismatched base pair is 
highlighted by shading) . The shaded and open circles 
represent imidazole and pyrrole rings, respectively, 
diamonds represent p-alanine, half -circles represent 
(CH 2 ) 2 OH groups, and curved lines represent (R)-2,4- 

25 diaminobutyric acid. 

Figure 33a represents a storage phosphor autoradiogram 
of the 8% denaturing polyacrylamide gel used to separate 
the fragments generated by DNase I digestion in a 
quantitative footprint titration experiment with polyamide 

30 ImPy- p-ImPy- (J?) H2N y-ImPy-C 2 -OH: lane 1, A lane; lane 2, 
DNase I digestion products obtained in the absence of 



165 



WO 98/49142 



PCT/US98/06997 



polyamide; lanes 3-12, DNase I digestion products obtained 
in the presence of 0.1, 0.2, 0.5, 1, 2, 5, 10, 20, 50, and 
100 nM polyamide ImPy- P-ImPy- (R) H2N y-ImPy-C 2 -OH, 

respectively. All reactions contain pJT-LTR 3'- 32 P-end- 
5 labeled EcoRl/Hindlll restriction fragment (15 kcpm) , 10 mM 
Tris*HCl, 10 mM KCl, 10 mM MgCl 2 , and 5 mM CaCl 2 (pH 7.0, 24 
°C) . Plasmid pJT-LTR was prepared by ligating an insert 
having the sequence 5 ' -CCGGTAACCAGAGAGACCCAGTACAGGCAA- 
AAAGCAGCTGCTTATATGCAGCATCTGAGGGACGCCACTCCCCAGTCCCGCCCAGGCCA 

10 CGCCTCCCTGGAAAGTCCCCAGCGGAAAGTCCCTTGTAGAAAGCTCGATGTCAGCAGTC 
TTTGTAGTACTCCGGATGCAGCTCTCGGGCCACGTGATGAAATGCTAGGCGGCTGTCAA 
TCGA-3' to the large Aval/ Sail fragment of pUC19. 

Quantitative DNase I footprinting on a 245 base pair 
3 ' - 32 P-end-labeled restriction fragment showed that ImPy- P- 

15 ImPy- (jR) H2N y-ImPy-C 2 -OH binds its match site 5 ' -AGCAGCTGCT-3 ' 
at nanomolar concentrations (apparent monomeric association 
constant, K a = 1.9 (±0.3) x 10 8 M' 1 ) , and also binds a 
single-base pair mismatch site 5 ' -AGATGCTGCA-3 ' with 9-fold 
lower affinity, K a = 2.2 (±0.5) x 10 7 M" 1 (Figure 33b). 

20 The binding data for match and single -base pair 

mismatch sites were well -fit by cooperative binding 
isotherms, consistent with formation of cooperative 2:1 
polyamide-DNA complexes. 151 A double-base pair mismatch 
site, 5' -AGCTGCATCC-3' , is also bound with 65-fold lower 

25 affinity. The fact that this mismatch site, which contains 
the "half-site" 5 ' -AGCTGCA-3 ' , is not effectively bound 
indicates that recognition of the match site occurs through 
cooperative dimerization, and not due to formation of 1:1 
hairpin complexes. 

30 Further study of the generality and sequence 

specificity of this motif is in progress and will be 
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reported in due course. For example, we found that the 
eight-ring polyamide ImPy-p- ImPyPy- (R) H2N y-PyImPy-C 2 -OH binds 
the twelve base pair match site 5 ' -AAGCAGCTGCTT-3 ' with 10- 
fold higher affinity than ImPy- p-ImPy- (R) H2N y- ImPy-C 2 -OH, 
5 and is approximately 100-fold specific for this site versus 
the double-base pair mismatch site 5' -CAGATGCTGCAT-3 ' 

The DNA-binding affinity and specificity of the six- 
ring polyamide ImPy- p-ImPy- (R) H2N y-ImPy-C 2 -OH for its ten 
base pair binding site are typical of standard six-ring 

10 hairpins which recognize five base pairs. Thus, use of a 
the cooperative hairpin dimer motif doubles the binding 
site size relative to the standard hairpin motif without 
sacrificing affinity or specificity, and without increasing 
the molecular weight of the ligand. As provided herein, a 

15 novel cooperative hairpin dimer motif, relatively low 
molecular weight pyrrole-imidazole polyamides (MW 
approximately 950-1,200) can specifically recognize 10-12 
base pairs of DNA. 

Figure 34 provides general polyamide motifs for use in 

20 desigining polyamides having improved binding and 
specificity. Figure 35 provides five general formulas for 
polyamides of the present invention. Figure 36 illustrates 
the DNA footprint analysis and affinities of additional 
cooperatively-bound polyamides. Figure 37 demonstrates the 

25 N-terminal extension of the polyamide ImPyPy-X-ImlmPy-y- 
PyPyPy-p-Dp where X is y, C5-8, p- P, or P-C5. 



167 



WO 98/49142 PCT/US98/06997 

Table 5 illustrates recognition of 15 Base-Pairs by 
ImPyPy-x-ImlmPy-y-PyPyPy-p-Dp polyamides. Association 
constants (K a ) for the match site 5 ' -AACCAAGTCTTGGTA-3 1 and 
specificities for the match site versus center (5>- 
AACCAACTGTTGGTA-3 1 ) and edge (5 ' -AACCAAGTCTTGCGA-3 1 ) 
mismatch sites are also illustrated. 

TAB LB 5 



X = 



Length of X K, (M ') Specificity 



Y 


5 


7 

1 x 10 


1 


Edge mismatch 
1 


C5 


6 


8 

4x10 


11 


19 


C6 


7 


8 

2x10 


12 


15 


C7 


8 


8 

2x 10 


2 


2 


P-P 


8 


8 

3x 10 


2 


2 


C8 


9 


8 

2x10 


10 


10 


P-C5 


10 


8 

1 x 10 


2 


2 



10 



15 



Solution conditions: 10 mM Tris*HCl, 10 mM KC1, 10 mM MqCl 2 
and 5 mM CaCl 2 at 24 <> c and pH 7.0. The parent hairpin Ac- 
ImlmPy-y-PyPyPy-p-Dp binds both 5 base pair binding sites 
within the 15 base pair target site 5 ' -AACCAAGTCTTGGTA- 3 ' 
with K a = 3 x 10 7 M" 1 . 
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EXAMPLE 12 

Recognition of 16 Base Pairs in the Minor Groove of DNA 
by an Im-Py Poly amide Dimer 

Cell -permeable small molecules which bind 
predetermined DNA sequences with affinity and specificity 
comparable to natural DNA-binding proteins have the 
potential to regulate the expression of specific genes. 
Recently, an 8-ring hairpin Py-Im polyamide which binds 6 
base pairs of DNA was shown to inhibit transcription of a 
specific gene in cell culture (Gottesf eld, et al. Mature 
1997, 387, 202-205). Polyamides recognizing longer DNA 
sequences should provide more specific biological 
activity. To specify a single site within the 3 billion 
base pair human genome, ligands which specifically 
recognize 15-16 base pairs are necessary. For this 
reason, recognition of 16 base pairs represents a 
milestone in the development of chemical approaches to 
DNA recognition (Dervan, P.B. Science 1986, 232, 464; 
Dervan, P.B. In The Robert A. Welch Foundation 

Conference on Chemical Research XXXI. Design of Enzymes 
and Enzyme Models; Houston, Texas, November 2-4, 1987; pp 
93-109; Dervan, P.B. In Nucleic Acids and Molecular 
Biology, Vol. 2; Springer-Verlag : Heidelberg, 1988; pp 
49-64; Moser, et al . Science 1987, 238, 645-650; Le Doan, 
et al. Nucleic Acids Res. 1987, 15, 7749; Strobel, et al . 
Science 1991, 254, 1639-1642; Thuong, et al . Angew. Chem. 
Int. Ed. Engl. 1993,. 32, 666-690). A Py-Im polyamide 
dimer which targets 16 contiguous base pairs in the minor 
groove of DNA is provided herein. 

As the length of a polyamide dimer having the 
general sequence ImPy 2 - 6 increases beyond 5 rings 
(corresponding to a 7 base pair binding site) , the DNA- 
binding affinity ceases to increase with polyamide length 
(Kelly, et al . Proc. Natl. Acad. Sci . U.S.A. 1996, 93, 
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6981-6985) . A structural basis for this observation is 
provided by the recently determined X-ray crystal 
structure structure of a 4 -ring homodimer in complex with 
DNA, which reveals a perfect match of polyamide rise-per- 
residue with the pitch of the DNA duplex, but overwound 
ligand curvature (Keilkopf, et al . Nature Struct. BioJ . 
1998, 5, 104-109). The curvature mismatch explains the 
observation that flexible p-alanine residues reset an 
optimum fit of polyamide dimers with the DNA helix at 
long binding sites (Trauger, et al . J. Am. Chem. Soc. 

1996, 218, 6160-6166; Swalley, et al . Chem. Eur. J. 

1997, 3, 1600-1607). 

The 16 base pair sequence 5 ' -ATAAGCAGCTGCTTTT-3 ' 
present in the regulatory region of the HIV-1 genome was 
utilized as a binding site (Jones, et al . Ann. Rev. 
Biochem. 1994, 63, 717-743; Freeh, et al . Virology 1996, 
224, 256-267) . Consideration of the previously published 
polyamide ring pairing rules (Wade, et al . J. Am. Chem. 
Soc. 1992, 114, 8783-8794; Mrksich, et al . Proc. Natl. 
Acad. Sci. 1993, 32, 11385-11389; Geierstanger , et al. J. 
Am. Chem. Soc. 1993, 115, 4474-4482; White, et al . Chem. 
& Biol. 1997, 4, 569-578; Pelton, et al . Proc. Natl. 
Acad. Sci. U.S.A. 1989, 86, 5723-5727 ; Pelton, et al. J. 
Am. Chem. Soc. 1990, 112, 1393-1399; Chen, et al. Nature 
Struct. Biol. 1994, 1, 169-175; White, et al . 
Biochemistry 1996, 35, 12532-12537), the A, T specificity 
of p/p pairs, and the "slipped" dimer motif 
(Geierstanger, et al . Nature Struct. Biol. 1996, 3, 321- 
324; Trauger, et al . Chem. & Biol. 1996, 3, 369-377) 
suggested that the 8-ring polyamide ImPy-P-ImPy-p-ImPy-p- 
PyPy-p-Dp (1) would specifically bind the target sequence 
as a cooperative antiparallel dimer (Fig. 38) . 
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Figure 38 illustrates a model of the complex of 
ImPy-p-ImPy-p-ImPy-p-PyPy-p-Dp (1, R = H) or ImPy-p-ImPy- 
P-ImPy-p-PyPy-p-Dp-EDTA*Fe(II) (1-E, R = R E ) (im = N- 
methylimidazole, Py = N-methylpyrrole, P = p-alanine, Dp 
= dimethyl ami nopropyl amide) with 5 ' -ATAAGCAGCTGCTTTT-3 ' . 
The shaded and open circles represent imidazole and 
pyrrole rings, respectively, and the diamonds represents 
P-alanine. Circles with dots represent lone pairs on N3 
of purines and 02 of pyrimidines, and circles containing 
an H represent the N2 hydrogen of guanine. Putative 
hydrogen bonds are illustrated by dashed lines. The 
polyamides were synthesized using solid-phase methods 
(Baird, et al . J. Am. Chem. Soc. 1996, 118, 6141-6146), 
purified by HPLC, and the identity and purity confirmed 
by X H NMR, analytical HPLC and MALDI-TOF MS. 

A quantitative DNase I footprinting experiment 
carried out on a 245 base pair 3 ' - 32 P-end-labeled 
restriction fragment revealed that the polyamide 
specifically binds it target site at subnanomolar 
concentrations (apparent monomeric association constant, 
Ka 3.5 x 10 10 NT 1 ) (Fig. 39) (Baird, et al . J. Am. Chem. 
Soc. 1996, 118, 6141-6146; Brenowitz, et al . Methods 
Enzymol. 1986, 130, 132-181; Cantor, C.R.; Schimmel, 
P.R., Biophysical Chemistry, Part III: The Behavior of 
Biological Macromolecules; W.H. Freeman, New York, N.Y., 
1980, p 863) . 

Figure 3 9 illustrates a storage phosphor 
autoradiogram of an 8% denaturing polyacryl amide gel used 
to separate the fragments generated by DNase I digestion 
in a quantitative footprint titration experiment with 
polyamide ImPy-p- ImPy-p-IrnPy-p-PyPy-p-Dp (1) : lane 1, A 
lane; lane 2, DNase I digestion products obtained in the 
absence of polyamide; lanes 3-10, DNase I digestion 
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products obtained in the presence of 0.01, 0.02, 0.05, 
0.1, 0.2, 0.5, and 1 nM polyamide ImPy-P- ImPy-p~ ImPy-p- 
PyPy-p-Dp, respectively; lane 11, intact DNA. All 
reactions contain 3 ' - 32 P-end-labeled EcoRI/tfindlll 
restriction fragment from plasmid pJT-LTR (15 kcpm) , 10 
mM Tris*HCl, 10 mM KC1, 10 mM MgCl 2 , and 5 mM CaCl 2 (pH 
7.0, 24 °C). b) Autoradiogram of a gel used to separate 
the fragments generated by an affinity cleavage reaction 
using polyamide ImPy-p-ImPy-p-ImPy-p-PyPy-p-Dp-EDTA^Fe (II) 
(1-E) . Lanes 1 and 5: A sequencing lanes; lanes 2-4: 
cleavage products obtained in the presence of 0.03, 0.1, 
0.3 and 1 nM ImPy-p-ImPy-P-ImPy-p-PyPy-p-Dp-EDTA*Fe (II) , 
respectively; lane 6: intact DNA. All reactions contain 
labeled restriction fragment (7 kcpm) , 20 mM HEPES, 300 
mM NaCl, 50 ng/mL glycogen, 1 Fe(II), and 5 mM DTT (pH 
7.3, 24 °C). The sequence of the restriction fragment in 
the region of the 16 base pair target site and a model of 
the (ImPy-P-ImPy-p-ImPy-p-PyPy-p-Dp-EDTA#Fe (II) ) 2 «DNA 

complex ..- are shown along the right side of the 
autoradiogram. Line heights are proportional to the 
observed cleavage intensity at the indicated base. 

The method used for determining association 
constants involves the assumption that [L] tot * [L] free , 
where [L] free is the concentration of polyamide free in 
solution (unbound) . For very high association constants 
this assumption becomes invalid, resulting in 
underestimated association constants. In the experiments 
described here, the DNA concentration is estimated to be 
- 5 pM. As a consequence, apparent association constants 
greater than 1-2 x io 10 M' 1 represent a lower limit of the 
true association constant. The binding data were well- 
fit by a cooperative binding isotherm, consistent with 
formation of a cooperative 2:1 complex. To provide 
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further evidence that ImPy-p. ImPy-p-ImPy-p-Pyp y .p_ Dp binds 
as an extended dimer, an affinity cleavage experiment was 
carried out with the polyamide-EDTATe (II) conjugate of 
ImPy-p-ImPy-p-imPy-p-PyPy-p-Dp shown in Fig. 39b . 
Cleavage was observed at each end of the match sequence, 
consistent with a dimeric, antiparallel binding mode. 
With regard to sequence specificity, there is a proximal 
two-base pair mismatch site, 5 ' -cAGATGCTGCATATa - 3 ' , to 
the 5' side of the 32 P- labeled strand which is bound with 
at least 35-fold lower affinity than the match site. 
However, other mismatch sites on the restriction fragment 
are bound with 10-20-fold lower affinity, revealing 
limitations of this first effort at 16 base pair 
recognition. Undoubtedly there is ample room for further 
optimization of sequence specificity. 

The high binding affinity and the affinity cleavage 
pattern observed for the 16 base pair polyamide-DNA 
complex inidcates that 8 pairs of amide residues form a 
fully overlapped core which properly positions the 6 
Im/Py pairs for recognition of 6 G,C base pairs and 2 p/p 
pairs for recognition of 2 A, T base pairs. Polyamides 
composed of 2 -ring subunits connected by p- alanine appear 
to be isohelical with B-DNA, and allow placement of 
imidazole residues at any ring position, thus providing a 
generalizable motif for recognition of predetermined DNA 
sequences. The data presented herein allows for the 
design of polyamides capable of binding 16 base pairs of 
DNA at subnanomolar concentrations of suitable size for 
permeating cells (i.e., MW - 1,200). 

The references described throughout this 
specification are fully incorporated by reference. While 
a preferred form of the invention has been shown in the 
drawings and described, since variations in the preferred 
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form will be apparent to those skilled in the art, the 

invention should not be construed as limited to the 

specific form shown and described, but instead is as set 
forth in the claims. 
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I claim: 



1 . A polyamide of the formula 




or a pharmaceutical^ acceptable salt thereof where: 

R\ R a , R b , R e , R £ , R\ Ri, R n , and R° are chosen independently 
from H, CI, NO, N-acetyl, benzyl, C 2 . 6 alkyl, d. 6 alkylaraine, C x . 6 
alkyldiamine, C x . 6 alkylcarboxylate, C^ alkenyl, and C a . 6 alkynyl ; 

R 2 is selected from the group consisting of H, NH 2 , SH, CI, Br, 
F, N-acetyl, and N-formyl; 

R 3 , R d , R 1 and R q are selected independently from the group 
consisting of H, NH 2 , OH, SH, Br, Cl, F, OMe, CH 2 OH, CH 2 SH, CH 2 NH 2 ; 

R 4 is -NH(CH 2 ) 0 _ 6 NR 5 R 6 or NH(CH 2 ) r CO NH <CH 2 ) 0 _ 6 NR 5 R S or NHR 5 or 
NH(CH 2 ) r CONHR 5 , where R s and R 6 are independently chosen from H, Cl, 
NO, N-acetyl, benzyl, c x . 6 alkyl, C^ 6 alkylamine, C^ alkyldiamine, C, 
6 alkylcarboxylate, d_ 6 alkenyl, C X . 6 L, where L groups are 
independently chosen from biotin, oligodeoxynucleotide, N- 
ethylnitrosourea, fluorescein, bromoacetamide, iodoacetamide , DL-a- 
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lipoic acid, acridine, ethyl red, 4 - (psoralen- 8 -yloxy) -butyrate , 
tartaric acid, (+) -a- tocopheral , and d. 6 alkynyl, where r is an 
integer having a value ranging from 0 to 6; 

X, X a , X b , X% X f X 1 , X\ X n , and X° are chosen independently from 
the group consisting of N, CH, COH, CCH 3/ CNH 2 , CC1, CF; and 

a, b, c, d, e, f, i, j, k, and m are integers chosen 
independently, having values ranging from 0 to 5. 

2. The polyamide of claim 1, wherein a is 3, b, g, m, n, q are 
0, o is 4 , and h is 2. 

3. The polyamide of claim 2, wherein R 2 and R 3 are H, R a , R 1 and 
R° are CH 3 , X is N and X a and X° are CH. 
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4 . A polyamide of the formula 




or a pharmaceutical].}/- acceptable salt thereof where: 

R 1 , R a< *- ra! and R b( i- m) are chosen independently from H, CI, NO, N- 
acetyl, benzyl, C 4 . § alkyl, d. fi alkylamine, Cj. 6 alkyldiamine, d. 6 
alkylcarboxylate, C z _ 6 alkenyl, and d. 6 alkynyl; 

R 2 is selected from the group consisting of H, NH 2/ SH, CI, Br, 
F, N-acetyl, and N-formyl; 

R f<n, and R c(k ' m > are selected independently from the group 
consisting of H, NH 2 , OH, SH, Br, CI, F, OMe, CH 2 OH, CH 2 SH, CH 2 NH 2 ; 

R 4 is -NH(CH 2 ) 0 . 6 NR S R 6 or NH(CH a ) r CO NH (CH 2 ) 0 _ 6 NR 5 R 6 or NHR 5 or 
NH (CH 2 ) r CONHR 5 , where R s and R 6 are independently chosen from H, CI, 
NO, N-acetyl, benzyl, C x . 6 alkyl, c,.« alkylamine, C^ € alkyldiamine, c x . 
6 alkylcarboxylate, C^ alkenyl, C^L, where h groups are 
independently chosen from biotin, oligodeoxynucleotide, N- 
ethylnitrosourea, fluorescein, bromoacet amide, iodoacetamide, DL-a- 
lipoic acid, acridine, ethyl red, 4- (psoralen- 8 -yloxy) -butyrate, 
tartaric acid, (+) -a-tocopheral , and C^ 6 alkynyl, where r is an 
integer having a value ranging from 0 to 6; 

X, x a<1 ' n) and x b(j * m) are chosen independently from the group 

consisting of N, CH, COH, CCH 3 , CNH 2 , CC1, CF; and 

a, b, c, d, e, f, g, h, i, j, k, 1, m, n, o and p are integers 
chosen independently, having values ranging from 0 to 5 . 



183 



WO 98/49142 



PO7US98/06997 



5. The polyamide of claim 4, wherein a is 3 , b is 4 , d is 1 , 
and c is 0, o is 4 . 

6. The polyamide of claim 5, wherein R 2 and R £{1) are H, R 1 , 
R a(i ' m! , and R b(3 ' ro) are CH 3 , X is N and X ali ' n) and X b,j ra> are CH. 

7. The polyamide of claim 4 including a core sequence selected 
from the group consisting of 

ImPyPyPy-y- PyPyPyPy , 

Py Py ImPy -y- PyPy PyPy , 

ImPyPyPy - y - ImPyPyPy , 

Py ImPyPy -y- PylmPyPy , 

ImPy ImPy -y- PyPyPyPy , 

ImlmPyPy-y-PyPyPyPy , 

ImlmlmPy-y- PyPyPyPy, 

ImlmPy Py -y- ImPyPyPy , 

ImPyPyPy -y-ImlmPyPy, 

ImlmPyPy-y-ImlmPyPy, 

ImPy ImPy-y- ImPy ImPy, . 

ImlmlmPy -y- ImPy PyPy Py , and 

Imlmlmlm-y-PyPyPyPy, where the core sequence is defined as the 
formula of claim 4 minus R 4 , Im is N-methyl- imidazole carboxamide, Py 
is N-methylpyrrole carboxamide, and y is y-aminobutyric acid. 

8. The polyamide of claim 4, wherein R 4 is (3-Dp, where (5 is |5- 
alanine and Dp is dimethylaminopropyl amide . 

9. The polyamide of claim 4 having an association constant of 
at least 10 s M' 1 . 

10. The polyamide of claim 4 wherein the ratio of the the 
association constant at match sites to the association constant at 
mismatch sites greater than one. 
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11. The polyamide of claim 4 wherein the ratio of the the 
association constant at match sites to the association constant at 
mismatch sites is greater than five. 

12. A polyamine including a core sequence selected from the 
group consisting of ImPyPyPy -y- PyPyPyPy, PyPylmPy-y- PyPyPyPy , 
ImPyPyPy-y-lmPyPyPy, PylmPyPy-y- PylmPyPy , ImPylmPy-y- PyPyPyPy , 
ImImPyPy-y-PyPyP y py, ImlmlmPy -y- PyPyPyPy , ImlmPyPy-y- ImPyPyPy , 
ImPyPyPy-y-imlmPyPy, ImlmPyPy-y- ImlmPyPy , ImPylmPy-y- ImPylmPy , 
ImImImPy~y- ImPyPyPyPy, Imlmlmlm-y- PyPyPyPy , Im-0-PyPy-y- Im-p-PyPy , im- 
p-Imlm-y-Py-p-PyPy, Im-p-ImPy-y-Im-p-ImPy, ImPyPyPyPy-y-imPyPyPyPy p 
ImlmPyPyPy-y-lmPyPyPyPy, ImPylmPyPy-y- ImPyPyPyPy , ImlmPylmlm-y- 

Py PyPyPyPy, 

ImPy PylmPy-y-ImPyPy ImPy, ImPy-p-PyPy-y- ImPy-p-PyPy, Imlm-p- Imlm-y- 
PyPy-p-PyPy, ImPy- P~ ImPy -y- ImPy- p- ImPy ImPy -p- PyPy Py- y- imPyPy-p-PyPy, 
Imlm-P-PyPyPy-y-PyPyPy-p-PyPy, ImPy-P- ImPy Py-y- ImPy Py-P- PyPy , imlm-p- 
PyPyPy-y-ImltnPy-P-PyPy, ImPy- p- Py PyPy -y-Py PyPy- p- ImPy , imPyPyPyPyPy-y- 
ImPyPyPyPyPy, ImPyPy-p-PyPy-y- ImPyPy-p-PyPy , ImPyPyPy -p- Py-y- i m _p- 
PyPyPyPy, ImlmPyPyPyPy-y- ImlmPyPyPyPy , Im- p- PyPyPyPy -y- Im-p -PyPyPyPy, 
ImPyPyPy-p-Py-y-i m pyPyp y -3-py / imPylmPyPyPy-y- imPy PyPyPyPy , ImPyPy-p- 
Py Py-y- ImPy-P-Py PyPy, ImPyPyPy Py- p -y- ImPyPyPyPy - p , ImPy-P-ImPyPy-y- 
ImPy-p-ImPyPy, Im- P-Py PyPy Py-y- ImPy PyPy-P-Py, Im-P~ ImPyPyPy -y - 
ImPyPyPy-p-Py, ImPyPy-P-PyPyPy , ImlmPy-p- PyPyPy , Imlmlm-p-PyPyPy , 
ImPyPyPyPy- P~Py PyPy, ImPy PyPy >P- PyPy Py, ImPy Py - P - Py Py Py Py Py , 
ImPyPyPy- p- PyPyPyPy, ImlmPyPy- p- PyPyPyPy , imlmlmPy-p- PyPyPyPy, 
ImPyPyPy- p- ImPyPyPy, ImlmPyPy-p-imPyPyPy , ImlmPyPy Py - p - PyPy PyPy Py , 
ImlmlmPyPy-P-PyPyPyPyPy, Imlm-p-pypy-p-pypy-p-pypy, ImlmPy-p-PyPyPy- 
p-PyPyPy, ImlmPyPy -p- Py-P -PyPyPyPy, IniPyPy-y-ImPyPy-p-PyPyPy , 
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ImPyPy-y-PyPyPy-p-PyPyPy, PylmPy-y- ImPyPy-P-PyPyPy , PylmPy-y- ImPyPy-P- 



ImPyPyPy-y-ImlmlmPy-p-PyPyPyPy, ImlmPyPy-y-ImlmPyPy-P-PyPyPyPy , and 
ImlmPyPy-y-PyPyPyPy-p-PyPyPyPy, where the core sequence is defined as 
the formula of claim 4 minus R 4 , Im is N-methyl- imidazole 
carboxamide, Py is N-methylpyrrole carboxamide, P is P-alanine and y 
is y-aminobutyric acid. 

13. The polyamide of claim 12, wherein R 4 is P-Dp, where p is 
P-alanine and Dp is dimethylaminopropylamide . 

14. The polyamide of claim 12 having an association constant of 
at least 10* M" 1 . 

15. The polyamide of claim 12 wherein the ratio of the the 
association constant at match sites to the association constant at 
mismatch sites greater than one. 

16. The polyamide of claim 12 wherein the ratio of the the 
association constant at match sites to the association constant at 
mismatch sites is greater than five. 

17. The polyamide of claim 4 suitable for recognition of 6 base 
pair sequences of double stranded DNA including a core sequence is 
selected from the group consisting of Im-p-PyPy-y-Im-P-PyPy , Im-P~ 
Imlm-y-Py-p-PyPy, and 

Im-P-ImPy-y-Im-P-ImPy, where the core sequence is defined as the 
formula of claim 4 minus R 4 , Im is N-methyl -imidazole carboxamide, Py 
is N-methylpyrrole carboxamide, p is p-alanine and y is y-aminobutyric 
acid. 

18. The polyamide of claim 17, wherein R 4 is P-Dp, where p is 
p-alanine and Dp is dimethylaminopropylamide. 



PyPyPy-P-PyPyPy, ImlmPy-y- ImPyPy-p-PyPyPy , ImPyPy-y- ImPyPy-G-PyPyPy, 
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19. The polyamide of claim 17 having an association constant of 
at least lO'M" 1 . 

20. The polyamide of claim 17 wherein the ratio of the the 
association constant at match sites to the association constant at 
mismatch sites greater than one. 

21. The polyamide of claim 17 wherein the ratio of the the 
association constant at match sites to the association constant at 
mismatch sites is greater than five. 

22. The polyamide of claim 4 suitable for recognition of 7 base 
pair sequences of double stranded DNA including a core sequence 
selected from the group consisting of ImPyPyPyPy-y-imPyPyPyPy, 
ImlmPyPyPy-y-ImPyPyPyPy, 

ImPylmPyPy-y- ImPyPyPyPy , 
ImlmPylmlm-y-PyPyPyPyPy, and 

ImPyPylmPy-y-ImPyPylmPy, where the core sequence is defined as the 
formula of claim 4 minus R\ im is N-methyl -imidazole carboxamide, Py 
is N-methylpyrrole carboxamide, p is P-alanine and y is y-aminobutyric 
acid . 

23. The polyamide of claim 22, wherein R 4 is P-Dp, where P is 
P-alanine and Dp is dimethylaminopropylamide . 

24. The polyamide of claim 22 having an association constant of 
at least 10'M* 1 . 

25. The polyamide of claim 22 wherein the ratio of the the 
association constant at match sites to the association constant at 
mismatch sites greater than one. 

26. The polyamide of claim 22 wherein the ratio of the the 
association constant at match sites to the association constant at 
mismatch sites is greater than five. 

27. The polyamide of claim 4 which is capable of forming a 
hairpin polyamide for recognition of 7 base pair sequences and is 
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selected from the following group consisting of core sequence 
composition: 

ImPy-p-PyPy-y-imPy-p-PyPy, 
Imlm-p- Imlm-y-PyPy-P-PyPy, and 
ImPy-P-lmPy-y-ImPy-p-ImPy. 

wherein Im is N-methyl- imidazole carboxamide, Py is N- 
methylpyrrole carboxamide, and y is y-aminobutyric acid, and p is p- 
alanine . 

28. The polyamide of claim 4 which is capable of forming a 
hairpin polyamide for recognition of 8 base pair sequences and is 
selected from the following group consisting of core sequence 
composition : 

ImPy-P-PyPyPy-y-I m pypy-p_pypy^ 
ImIm-p-PyPyPy-y-PyPyPy-p_pypy # 

ImPy-p-lmPyPy-y-lmPyPy-P-PyPy, 
Imlm-P-PyPyPy-y-imlmPy-p-PyPy, and 
ImPy-P-PyPyPy-y-pyPypy-p- ImPy . 

wherein Im is N-methyl -imidazole carboxamide, Py is N- 
methylpyrrole carboxamide, and y is y-aminobutyric acid, and p is 0- 
alanine . 

29. The polyamide of claim 4 which is capable of forming a 
hairpin polyamide complex and is selected from the group consisting 
of the following: 

ImPyPyPyPyPy-y- ImPyPyPyPypy 
ImPy Py - P - Py Py - y - IniPy Py - p - py py 
ImPyPyPy-p-Py-y-l m -p-PyPyPyPy 
ImlmPyPyPyPy-y- imlmPyPyPyPy 
Im- P - PyPyPyPy -y- l m _ p - pypypypy 
ImPyPy Py- p - py-y- imPyPyPy - p - Py 
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ImPylmPyPyPy-y- ImPyPyPyPyPy 
ImPy Py - p - PyPy - y - imPy - P - Py Py Py 
ImPy Py Py Py - p - y - ImPy Py Py Py - p 
ImPy- P-ImPyPy-y- ImPy- p- ImPy Py 

wherein Im is N-methyl -imidazole carboxamide, Py is N- 
methylpyrrole carboxamide, and y is y-aminobutyric acid, and p is P- 
alanine . 

30. The polyamide of claim 4, which is capable of forming a 
hairpin polyamide for recognition of 8 base pair sequences and is 
selected from the following group consisting of core sequence 
composition : 

I m - P - Py Py Py Py - y - 1 mPy Py Py - p - Py , and 
Im- P - ImPyPyPy-y- ImPyPy Py - p~ Py . 

wherein Im is N-methyl-imidazole carboxamide, Py is N- 
methylpyrrole carboxamide, and y is y-aminobutyric acid, and p is p- 
alanine . 

31. The polyamide of claim 4 which is capable of forming a 
"overlapped" or "slipped" polyamide complex and is selected from the 
group consisting of the following: 

ImPyPyPy 

ImPyPyPyPy 

ImPyPyPyPyPy 

ImPyPyPyPyPyPy 

ImPyPyPyPyPyPyPy 

ImPyPyPyPyPyPyPyPy 

ImlmPyPy 

ImPy ImPy 

Im-p-ImPy 

ImlmPyPyPyPy 
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ImlmlmPyPyPy 

ImPyPy >p- PyPyPy 

ImlmPy-p-pypypy 

Imlmlm-p- PyPyPy 

ImPyPy-G-PyPyPy 

ImPyPy-G- ImPyPy 

ImPyPy - P - ImPyPy 

ImPyPy- C5- ImPyPy 

ImPyPy- C6 - ImPyPy 

ImPyPy -C7 - ImPyPy 

ImPyPy -C8 - ImPyPy 

ImPyPy -C9 - ImPyPy 

ImPy PyPyPy - p - PyPyPy 

ImPyPy Py - p - PyPyPy 

ImPyPy- p-PyPy PyPyPy 

wherein Im is N-methyl- imidazole carboxamide, Py is li- 
me thy Ipyr role carboxamide, p is .p-alanine, C5 is 5-aminopentanoic 
acid, C6 is 6-aminohexanoic acid, C7 is 7-aminoheptanoic acid, C8 is 
8-aminooctanoic acid, C9 is 9-aminononanoic acid. 

32. The polyamide of claim 4 which is an 8-ring beta alanine 
extended polyamide and is selected from the group consisting of the 
following core sequence composition: 
ImPyPy Py- p - Py PyPyPy 
ImlmPyPy- p - PypyPy py 
ImlmlmPy-p-Py PyPyPy 
ImPyPyPy-p- ImPyPy Py 
ImlmPy Py - p - ImPyPyPy 

wherein Im is N-methyl -imidazole carboxamide, Py is n- 
methylpyrrole carboxamide, and P is P-alanine. 
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33. The polyamide of claim 4 which is an 8 -ring beta alanine 
extended polyamide and is selected from the group consisting of the 
following core sequence composition: 

ImlmPy Py Py - p - PyPyPyPyPy 
ImlmlmPyPy-p-PyPyPyPyPy 
Imlm-P-PyPy-p-PyPy-p-PyPy 
ImlmPy - p - Py Py Py - p - PyPypy 
ImlmPyPy - P - Py - p - PyPyPyPy 

wherein Im is N-methyl -imidazole carboxamide, Py is N- 
methylpyrrole carboxamide, and P is P-alanine. 

34, The polyamide of claim 4, which is capable of forming a 9- 
ring or 12-ring extended hairpin polyamide complex for recognition of 
9 or 11 base pair sequences and is selected from the following group 
consisting of core sequence composition: 
ImPyPy-y-lmPyPy-p-PyPyPy, 

ImPyPy - y- Py Py py- p - PyPy Py , 
PylmPy-y-imPyPy-p-PyPyPy, 
ImlmPy - y - ImPyPy - p - PyPyPy , 
ImPyPy-y-lmPyPy-G- PyPypy, 
ImPyPyPy-y- ImltnlmPy- p- PyPyPyPy , 
ImlmPyPy-y-imlmPyPy-p-PyPyPyPy, and 
ImlmPy Py-y- PyPyPyPy- p - PyPyPyPy . 

wherein Im is N-methyl -imidazole carboxamide, Py is N- 
methylpyrrole carboxamide, and y is y-aminobutyric acid, and p is p- 
alanine, and G is glycine. 

35. A composition comprising the polyamide of claim 4 and a 
pharmaceutically acceptable excipient . 
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FIG. 16 



5 ' -W G W W W C W-3' 5'-W G G W W C W-3 ' 5 ' -W G W G W C W-3 * 

t<XXKX »*OOOn #0#0<X 

iyocKxxy^ mooor +xx>ooo# ; 

3 ' -W C W W W G W-5' 3*-W C C W W G W-5 ' 3 ' -W C W C W G W-5' 

K a = 1x10 10 M' 1 Ka^SxIO 9 !^" 1 K a < 1 x 10 8 M" 1 

5 ' -W G C W G C W-3' 5'-W G G W G G W-3' 5 ' -W G W G W C W-3' 

at^tWfl-s- 3 + ^WW^ w . 5 . 3 t$?|?^t ) w . 5 . 

K a <1«108M-' K a < 1X108M-1 K a = 1 x tO 9 M"1 

5 ' -W G C W G C W-3' 5 ' -W G G W G G W-3' 5'-W G W W W G W-3' 

•OO+Os ++<>++^ tOOOfs 

3 -W C G W C G W-5' 3 -W C C W C C W-5' 3 ' -W C W W W C W-5' 

Ka= 2x10 10 M- 1 K a= 5x10 9 W 1 K a s5x10 9 M- 1 
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FIG. 17 
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282 bp 



1 pM ImlmPyPy- y-lmlmPyPy- p-Dp-EDTA (1-E) 



I Hill 

5-CAGTCAT<ra5GOT6ACTQTCAGTCAT65CC7TbA^ 

>''GTCAGTA ^CCCTAy TQACAQTCAQTA< fCQQTAy TQACAQTCAQTA< ^GCGTAp TQAC-6' 

f f 

1 pM ImPylmPy- y-lmPylmPy- P-Dp-EDTA (2-E) 



I. till. 

VTf SCCA I pACTGTCAGTCATGCGCA I pi 
rAt fCGQTAy TGACAOTCAGTA yjCGTA^ I 



Mill • 



S'-CAGTCATfiCBGATpACTGTCAGTCAT* 

t-gtcagta4ccc7a4tgacagtcagt> 



5 \iM Imlmlmlm- y p y p y p y p y- P-Dp-EDTA (3-E) 

ill. 

5'XAGTCAT(£CCiGXT^CTGTCAGTCATGGTCATbACTGTCAGTCATGCi 
a-.GTCAGTAicCCTA^TGACAQTCAOTA^CQQTA^TQACAGTCAGTA^GI 



ACTG-3* 
;TGAC-5' 



III' 



3ACTG-3* 
1TGAC-5' 
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FIG. 18 
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FIG. 19 
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FIG. 21 
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FIG. 24 
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FIG. 25 
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A. 



B. 



FIG. 26 
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OjN^ 5 ^ O O | 

I mPy Py Py - y - Py Py Py Py - p - Dp - NBD 
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I mPyPyPy -y- Py Py Py Py - p - Dp - Dan sy 1 
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FIG. 26 




i m p y PyPy- r lmPyPyPy-p-ED-TO 
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FIG. 29 
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Target Sequence 
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restriction 
digest 



^ Restriction Fragments 



f ad 
~ co 



add poiyamide-biotin 
conjugate 



Poiyamide ' 




Biotin 



add streptavidin- 
magnetic beads 




H- Biotin 

Streptavidin Remove Supernatant 



Magnetic Bead 



Change sart conditions 




Magnetic Bead 
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FIG. 30 
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Figure 31 
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FIG. 32 



5 ' -A GATGCTGCATAT AAGCAGCTGC 1 ! 
3'-T C[T]A CGACGAATA.TTCGTCGACGi 

Mismatch Match 
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FIG, 33 
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FIG. 34 



(2-p) x motif: a generalizable polyamide motif 



Hairpins 

5'-T G C T G C A-3' 
•<X>*CK 

3' -A C G A C G T-5' 

K. = 2 x 10«> 

S'-T G GTGG A-3' 

XXXKX>0K- 

3' -A C C A C C T-5' 
Ka = 8 X 109 

5' -T G 7 T A C A-3' 

3' -A C A A T G T-5' 
Ka = 3 X 109 



Extended dimers 



5'-AGCAGC?GC T-3' 
tOO#OOt(XH 

3'-? CGTCGACG A-5' 



5'-A TAAGCAGCTGCTT? T-3' 
•O<>#0^#0^>00<« 

KK)CK>0#<K>#<>0« 

3'-T ATTCGACGACGAAA A-5' 



Extended hairpin dimers 

5'-A GCAGCTGC T-3' 



3'-T CGTCGACG A-5' 
5'-A A G C A GCTGCT T-3' 



3'-TTCGACGACGA A-5' 
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FIG. 35A 
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FIG. 35B 



5'--C 



H 2 N* 



3'-G 




A G 



T C 



TGCTGCATA T-3' 



-OfOofO- 




NH. 



ACGACGTAT A-5 



Ka <10 8 M-i 



5 ' -A TAAGCAGC T G C TTT T-3'" 
3 ' -T ATACGACGACGAAA A-5" 

Ka = 2 x 10 9 M-i 
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FIG. 36 




1 ImPyPy-p-p-ImlmPy-Y-PyPy/Py-p-Dp 




2 (n=3) ImPyPy-t-ImlmPy-Y-PyPyPy-P-Dp 

3 (n=2) lmPyPy-e-ImImPy-Y-PfyP>'P)'-P-Dp 

4 (n=l) ImPyPy-6-ImImPy-Y PyP> Py-P-Dp 

5 (n=0) imPyPy-Y-imimPy-Y-P'yPyPy-p-Dp 
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FIG. 37 



N-termina! 
extension 



-A A C 


C A A G T • 

<xx± #ck- 


C T T G G T A- 


-T T G 


G T T C A • 


G A A C C A T- 


ImPyPy-X-ImlmPy-. 


Y-PyPyPy-p-Dp 


X 


Affinity 


Specificity 


Y 


lxlO 7 


none 


C5 


4xl0 8 


10-20-fold 


C6 


2xl0 8 


10-15-fold 


C7 


2xl0 8 


2-fold 


C8 


2xl0 8 


10-fold 


P-fJ 


3xl0 8 


2-fold 


P-C5 


IxlO 8 


2-fold 
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FIG. 38 
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FIG. 39 
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FIG. 42 



5 ' _T 1SJL2A A-3 ' 250x increasse 
tKX^OO^ ^ 

3 ' -A C G A C G T-5' 
Ka = 8 X 107 



5'-T G C T G C A-3' 

•CX>#CK 

3 ' -A C G A C G T-5' 
Ka = 2 X 1010 



5'-t ^1X1£*~ 2 ' >150x incresase 
.x>00000 ) ESSSS=^ ^ 

3 ' -A C C A C C T-5' 
K, < 5 X 107 



5'-T G G T G G 


A-3' 


♦>oooooo 

3' -A C C AC C 


T-5' 


Ka = 8 X 109 





5 ' -T G T T A C 

toooo 

•x>oooo# 


A-3' 


15x decrease 


5 ' -T G T T A C 

toooo 

♦>ooooo# 

3' -A C A A T G 


A-3' 
T-5' 


3' -A C A A T G 
Ka = 5 X 1010 


T-5' 




Ka = 3 X 109 
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FIG. 43 



TATA box 



5'-TTGGCTC 



3'-AACCGAG 



T T T A A A 



A A A T T T 



G G G C G C G G-3 



C C C G C G C C-5' 



R2-1 Ka = 1 x 109 M 



5'-TTGGCTC 
3'-AACCGAG 



T T T A A A i 
A A A T T T 



G G G C G C G G-3' 
C C C G C G C C-5' 



R2-2 Ka <: 10 8 IVH 
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